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Introduction to Block C 


This applies particularly to 
Chapters BI and B2. 


The Latin word calculus 
means a stone. The link is in 
the use of stones for counting. 


The Summary of Block A, at the end of Chapter A3, described the 
difference between discrete models and variables, on the one hand, and 
continuous models and variables on the other. In much of Block B you 
used ideas from discrete mathematics, such as recurrence relations and 
sequences, to model phenomena that involve change. 


In Block C you will again be modelling change, but this time in cases 
where the change is continuous or can reasonably be modelled as being so. 
Block C addresses questions such as the following: How fast is something 
changing? Is it increasing or decreasing? Does it ever stop changing, 
permanently or temporarily? How does the change accumulate? How can 
the change be described by an equation? 


Chapter Cl concerns the rate at which variables change, whether the 
change is an increase or decrease and whether the change stops. A 
mathematical process called differentiation is introduced to help in 
analysing these matters, together with some illustrations of its practical 
applications. 


Chapter C2 takes a different view towards change. The process of 
integration is introduced, as the reverse process of differentiation. 
Integration permits an accumulation of continuous changes to be described, 
and one significant application is to finding areas beneath graphs. 


Integration also features importantly in Chapter C3. Here it is needed in 
order to solve a new type of equation, called a differential equation, which 
arises frequently in mathematical models of phenomena which change. 


The branch of mathematics which includes the study of differentiation and 
integration is called calculus, so Block C provides an introduction to 
calculus. The history of this branch of mathematics goes back to the 
second half of the seventeenth century, when Sir Isaac Newton (1642-1727) 
in England and Gottfried Wilhelm Leibniz (1646-1716) in what is now 
Germany both developed independently the basic ideas of calculus. The 
word ‘calculus’ here means ‘a systematic method for solving a certain type 
of problem’, and what Newton and Leibniz did was not so much to 
discover the basic ideas as to bring them together in a systematic way and 
then show how their systematic methods could be used to solve problems. 


Unfortunately, a bitter argument grew up over who was the first to 
‘discover’ calculus. This split the mathematicians of the day (and for 
decades afterwards) into two camps and hindered the development of the 
subject. In fact, one legacy of the dispute is still with us: there are several 
notations in current use for expressing exactly the same thing in calculus. 
One set of notations can be traced back to Newton’s work and another to 
that of Leibniz. Other notations arose from more recent attempts to ‘tidy 
up’ the subject, following on from the greater understanding of calculus 
and its associated ideas that has developed since the time of Newton and 
Leibniz. You need to be able to recognise, understand and occasionally use 
all of these notations, even though you will probably settle eventually for 
using mainly one of the possible alternatives. 


Many branches of mathematics, science and technology have made use of 
calculus, which indicates its central importance and usefulness. The 
chapters of this block refer to a range of contexts in order to illustrate the 
applications of calculus. 


There are five sections in this chapter. They are 
intended to be studied consecutively in five study 
sessions. Each section requires two to three hours. 


The pattern of study for each session might be as 
follows. 

Study session 1: Section 1. 

Study session 2: Section 2. 

Study session 3: Section 3. 

Study session 4: Section 4. 

Study session 5: Section 5. 

Section 2 is the longest and Section 3, which 
contains some material that can be omitted if you 
are short of time, is the shortest. 

Subsection 1.2 requires the use of an audio cassette 
player, and Subsection 3.3 requires the use of a 
video cassette player. Section 5 requires the use of 
the computer and Computer Book C. 


The optional Video Band C(i), Algebra 
workout — Differentiation, could be viewed at 


any stage during your study of this chapter. 


Introduction 


The use of a function to 
represent the relationship 
between a dependent variable 
and a single independent 
variable was introduced in 
Chapter A3, Subsection 1.1. 


Polynomial functions were 
introduced in Chapter A3, 
Section 5. 


This chapter introduces an important topic in calculus called 
differentiation. If a dependent variable y is related to a particular 
independent variable x, which can vary continuously, then differentiation 
enables us to find the rate at which y changes as x changes. Put another 
way, this rate is the gradient of the graph defined by the equation y = f(x) 
that relates the two variables. The expression for the rate of change in 
terms of x is called the derivative of f(x). 


Suppose, for example, that the motion of a car along a straight road is 
described by expressing its position relative to a fixed starting point in 
terms of the elapsed time. Then differentiation gives the rate of change of 
the car's position with time, which indicates just how fast the car travels 
during its journey. This rate of change, known as the velocity of the car, 
may itself vary with time. Differentiation permits the ‘velocity function’ of 
the car to be obtained directly from its ‘position function’, where the 
independent variable is time in each case. 


Section 1 defines the process of differentiation, illustrating its application 
first to quadratic functions and then to power functions. In Section 2, 
differentiation is applied to polynomial functions, following which you will 
see how differentiation can be employed to find the maximum or minimum 
value attained by a function. The process of finding these values is called 
optimisation, and is of use both for modelling purposes and for permitting 
the graphs of functions to be sketched quickly. While all the examples 
considered here involve polynomial functions, the principles apply also to 
many other types of function. 


In Section 3, differentiation is applied to a wider range of functions, 
namely the trigonometric functions sin and cos, the exponential function 
exp and the natural logarithm function In. The results are incorporated 
into a table giving the derivatives of important basic functions, to which 
are added two simple rules for differentiating functions defined as sums or 
constant multiples of those in the table. The alternative Leibniz notation 
for derivatives is described. 


Section 4 gives three further rules for differentiation, which apply to 
functions that are defined as products, quotients and composites of the 
more basic functions considered earlier. The results in Sections 3 and 4 
permit the differentiation of a very wide range of functions. 


In Section 5 you will see how optimisation, based on differentiation, can be 
applied with the help of the computer, in the context of modelling various 
real-life problems. 


1 What is differentiation? 


To study Subsection 1.2 you will need an audio cassette player and Audio 
Tape 3. 

In this section you will see how the idea of the gradient of a straight line 
can be extended to apply to any smooth graph. After an informal 
introduction in Subsection 1.1, you will see in Subsection 1.2 how to find 
the gradient for any quadratic graph. Subsection 1.3 formalises the process 
of finding the gradient for a curve, which is known as differentiation, and 
shows how this process can be applied to power functions. 


1.1 Gradients of curves 


A linear function which relates the dependent variable y to the 
independent variable x has a graph with equation of the form 
y=mrt+e. 
Here m is the gradient or slope of the graph, which represents the ratio of 
any change in y to the corresponding change in x. In other words, we have 
change in y (= rise ) 
corresponding change in x run/ * 


m= 


as illustrated in Figure 1.1, for m > 0. 


=mrt+ec 


RY 


Figure 1.1 Gradient m of a linear function 


The graph of a linear function is equally steep at all points and so has 
constant gradient. This is not true for the graphs of non-linear functions. 
For example, the graph of the function f(x) = 2? is ‘flat’ at the origin and 
becomes steeper as you move away from the origin to the left or right, as 
shown in Figure 1.2. 


In general, we need to talk about the gradient or slope of a graph at a 
point on the graph. This is the gradient of the unique line that ‘touches’ 
the graph at the point, as shown in Figure 1.3(a) overleaf. This line, the 
one which most closely approximates the graph of y = f(x) near the point, 
is called the tangent to the graph at the given point. 


Linear functions were 
introduced in Chapter A3, 
Subsection 1.2. The equations 
of lines and the definition of 
the slope or gradient of a line 
were given in Chapter A2, 
Subsection 1.1. 


If y decreases as « increases, 
then m is negati If y does 
not change as x increases, 
then m = 0. 
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Figure 1.2. Graph of y = x” 


The word ‘tangent’ comes 
from the Latin verb tangere. 
meaning to touch. 


One way to visualise the 
tangent at a point on the 
graph in Figure 1.3(a) is as 
follows. Think of 
progressively ‘zooming in’ on 
the portion of the graph 
which contains the particular 
point. This causes the 
portion of graph focused on 
to look straighter and 
straighter. Eventually it will 
look indistinguishable from a 
line segment, which is part of 
the tangent to the graph 
through the point. 


Figure 1.3 
at (0,0) 


(a) Tangent to a graph at a point (b) No tangent to the graph 


You will see in this chapter that, for many functions f, the graph of f has 
a tangent at every point. Informally, a function is said to be smooth if it 
has this property. Any linear function is certainly smooth, since the 
tangent at each point coincides with the original straight-line graph. On 
the other hand, the function f(x) = || is not smooth. The graph of this 
function is shown in Figure 1.3(b). Zooming in on the point (0,0) will 
always give a V-shape for the graph rather than a line segment. Hence 
there is no tangent to the graph at this point. 


The gradients of tangents can often be expressed by a formula related to 
the rule of the function f. Differentiation is the process of finding this 
formula for all the tangent gradients, starting from the rule of the original 
function. 


For the line of Figure 1.1, you can think of the gradient as a measure of 
how fast variable y is changing as variable x changes: a large gradient 
corresponds to a rapid change of y with respect to x, and a small gradient 
corresponds to a slow change of y with respect to x. The gradient of the 
line represents the rate of change of y with respect to x. 


Similar remarks apply for graphs which are curved, such as the one in 
Figure 1.3(a), though here the rate of change of y with respect to x, being 
equal to the gradient at each point, varies as the point moves along the 
curve. The ‘rate of change’ relationship between variables is important in 
applications, and it is differentiation which enables us to find this rate of 
change at any point. You will see shortly how this process is carried out. 


Differentiation: informal description 

Differentiation is a process which enables you to find 

© the gradient of a graph: 

© the rate at which one variable changes with respect to another. 


1.2 Gradients of quadratic graphs 

In the audio tape that follows, you will see that all quadratic functions are 
smooth and how to calculate the gradient at any point on any quadratic 
graph. 

Now listen to Audio Tape 3, Band 1, ‘Gradients of quadratic graphs’. 


SECTION 1 WHAT IS DIFFERENTIATION? 


The graph of y = x? 


Chord line segment 
joining two points 
on a curve: 


gradient = ee 
run 


a 
1A a : For BD, 
rise_ 4-1 


run 2-1 


oe See tangent at B 


What is the gradient 
of the tangent. 
at B? 


Chord | OB AB BC BD 
Gradient ee. «86S 


Gradient of tangent at (1, 1) 
Gradient of chord with endpoints 
(1,1) and (1+ (14 h)*) is 


rise _(1+h)°=1 
run h 


14+2h4h-1 (ent 


When h is small: 


9 gradient of chord is close to gradient 
of tangent at (1,1); a) 


0 2+ his close to 2. 


So gradient of tangent at (1,1) is 2. 
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Gradient formula for f (x) = a 


Gradient of y = f(x) at (x, f(x)): 
Fixjsee 


gradient of tangent 
at (x, f(x)) 


gradient: 2ax gradient: b gradient: O 


General quadratic 
gradient formula 


Gradient of y = f(x) at (x, f(x)): 
f(x) = 2ax+ b. 


Example ; 
Let f(x) = 3x*-5x+2. a 
Find the gradient of y = f (x) at F Qand R. 


f)=ex—5 
Gradient at P: 6xO-5=-5 
Gradient at @ 6*I)-5 = | 
Gradient at R: (6x2)-5 se 
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In Frame 4 of the audio tape it was stated that the gradient formula for a 
general quadratic function is given as: 


if f(x) =ax? +br +e, then f'(x) =2axr +b, (1.1) As mentioned on the andio 
tape, f’(2) is pronounced as 


where a. 6 and ¢ are constants. This gives the value of the gradient of the ‘f dash of x’ or ‘f prime of x’. 


graph of y = f(a) at the point (x. f(x)), which is also the rate of change of 
y with respect to x at this point. 


The following activities ask you to apply the quadratic gradient formula 
from result (1.1). 


Use result (1.1) to write down an expression for f’(zr) in each of the 
following cases. 

(a) f(z) =2?+24+7 

(b) f(z) = 52? —2 

(c) f(z) =5+ 3x - 32 

Solutions are given on page 54. 


Use result (1.1) to find the gradient at the point given on the graph of 
y = f(x) in each of the following cases. 


(a) f(@@)=27, (3,5 

(b) f(x) =—-a? +1, (3,-8) 

(c) f(x) = 32? + 32x — 242, (1, -207) 
Solutions are given on page 54. 


In this argument, regard x as 
fixed while h varies. The 
quantity h can be positive or 
negative, but not zero. 
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CHAPTER C1 DIFFERENTIATION AND MODELLING 


In Frame 2 of the audio tape you saw why the gradient of the function 
f(x) =<? at the point B(1,1) is 2, and it was asserted in Frame 3 that the 
gradient at a general point (x, f(x)) is given by 

f'(x) = 2z. (1.2) 
We now justify this assertion, using reasoning similar to that in Frame 2. 
Figure 1.4 shows the graph of y = x’, with a particular point («, x?) 
marked and a chord with one endpoint at (x, x). 


uA 


(+h, (x+h)?) 


Figure 1.4 Finding the gradient for y = «? 
at a general point on its graph 


The other endpoint of the chord is at (2 +h, (a + h)*), a nearby point on 
the graph. For small values of h, the gradient of the chord is close to the 
gradient of the tangent at («,2?). The gradient of the chord is 
rise (a +h)? — 2? 
run («+h)—& 
_ 2? 42ch+h? - 2? 
— 
_ 2ah+h? 
SS 
=2r+h. 


Now the smaller h becomes, the closer the chord gradient is to the tangent 
gradient, and the closer 2x + h is to 2x, so we conclude that the gradient 
of the tangent at (x, 2”) is 27, which confirms equation (1.2). 


The argument to establish the gradient formula for the general quadratic 
function f(a) = ax? + br +c, as given by result (1.1), is very similar to 
that just employed to verify formula (1.2) for the special case where a = 1, 
b=0 and c=0. Moreover, the approach adopted here, of first evaluating 
the gradient of a chord from (x, f(x)) to (a +h, f(a +h)) and then ‘taking 
the limit’ as h tends to zero, can be applied much more generally. This is 
taken up in the next subsection. 


SECTION 1 WHAT IS DIFFERENTIATION? 


1.3 Differentiation of a function 


A formula for the gradient of the graph of a general function can be found 
by applying the same approach as was used at the end of Subsection 1.2 
for the particular case f(x) = x?. 


Suppose that f is any smooth function, so the tangent to its graph exists 
at all points. Figure 1.5 shows part of this graph, with a chord marked 
between the two points (x, f(x)) and (2+ h, f(z + h)). 


ua 


f(2@+h) — f(z) 


angen' 


ti it 
(x, f(z) at (x, f(x) 


> 
z 


oO x z+h 


Figure 1.5 Finding the gradient for y = f(x) at a general 
point on its graph 


This chord has gradient Once again, regard x as fixed 
rise _ f(a+h) — f(z) wile Late The quantity 
plane LE ae positive or negative, 
run h but not zero. 


Since f is a smooth function, its graph has a tangent at (a, f(x)), and the 
gradient of this tangent, denoted by f’(x), must be the limit of the chord 
gradient values as h tends to zero. In other words, we have 


f(z) = lim (ffs) 5 (1.3) The symbol jim is read as 


‘the limit as h tends to 0’. 
We call f'(x), as defined by this formula, the derivative of f at the 
point x. For example, for the function f(.) = 2?, where as you have seen 
J'(x) = 22, we have: 
© the derivative of f at 1 is f’(1) = 2; 
© the derivative of f at 0 is f’(0) =0. 
In general, the derivative f’(x) depends both on the function f and on the 
point x. Hence the derivative defines a function, called the derived 
function of f and denoted by f’. For example, for the function f(x) =x?, — Often the word ‘derivative’ is 
the derived function f’ is defined by f'(x) = 2x. also used to refer to the 


derived function f’. 
The process of finding the derived function or derivative of a given ee se! 


function f is called differentiation. 
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The verb associated with 
differentiation is ‘to 
differentiate’. The meaning of 
this verb in calculus is 
therefore not the same as its 
meaning in everyday English 
(where it means ‘to 
distinguish between’ or ‘to 
make different’), 


Using this definition directly 
is also called ‘differentiating 
from first principles’. 
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| CHAPTER C1 DIFFERENTIATION AND MODELLING 


° 


Differentiation, derivative and derived function 
Let f be a function. 


The derivative f’(x) at a point x in the domain of f is the 
gradient of the graph of f at (x. f(x)), given by 
(£ +h) - —) 
h ’ 
provided that this limit exists. 


f(z) = lim 


h—0 


(1.3) 


If the derivative exists at each point of the domain of f, then we 
say that f is smooth. 

The process of finding the derivative f’(x) is called 
differentiation. 

The function f’ defined by the process of differentiation is called 
the derived function of f. 


If y= f(x), then f’(x) is the rate of change of y with respect 
tox. 


A word on domains is in order. The main emphasis in this course is on 
finding the rule for the derived function, f’, of f. However, f’ must also 
have a domain and this consists of those points in the domain of f where 
the derivative exists. In particular, if f is a smooth function, then the 
domain of f‘ is the same as that of f. 


Depending on the particular purpose which you have in differentiating, you 
can regard the derivative as giving the gradient of the graph of the original 
function or the rate of change of the dependent variable with respect to the 
independent variable. Both interpretations are useful and widely applied. 


So far you have seen how to differentiate quadratic functions. There are 
many types of smooth functions. In Sections 3 and 4 you will see the 
derived functions for a range of basic functions. together with some rules 
for finding derived functions for a wide class of functions. 


Other than in this section, we shall not ask you to spend time in justifying 
these various results formally, since this is of secondary importance 
compared to being able to apply them appropriately. Such justifications 
involve starting in each case from definition (1.3), and we now look briefly 
at what this involves in a few simple cases. 


Use equation (1.3) to differentiate the function f(x) = 2°. 


Solution 


By equation (1.3), we need to consider the quotient 


f(w+h)— f(a) _ (w@+h)—23 


h 7 h - 


where h is a non-zero number. 


SECTION 1 WHAT IS DIFFERENTIATION? 


Now we have 
(a +h) = (a+h)(x+h)? 


= (x+h)(x? + 2rh +h?) 
= 2° +3a7h + 3ch? + hi, 
so the quotient becomes 
f(e+h)— f(@) _ 2° +32%h + 3ch? +h? — 28 Since h #0, the possibility of 
h h sion by zero does not 


_ 37h + 3ch? + h® 
om h 
= 3x" + 30h +h’. 


For small values of h, the expression 3c? + 3h + h? is close to 3x? because 
both 3h and h? are small. Thus in the limit as h 0, we obtain 


f'(z) = 32. 


The result of Example 1.1 can be applied to give the gradient at any point 
on the graph of y = x*. For example, the points (1,1) and (—1,—1) both 
lie on y = x*, and the gradients at these points are respectively 


f'(1)=3 and f’(-1)=3. 


The tangents at these points are shown in Figure 1.6, together with the 
tangent to y = 2? at (1,1), which has gradient 2, for comparison. 


Figure 1.6 Gradients of tangents to the graphs of y = x* 
and y= 2 
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Note that if 2 = }, then 
y = (4)* = 4. Hence the 
point (4, 75) lies on the 
graph. 


Recall that x° = 1 
and x! =z, 


The domain of a power 
function depends on the value 
of n. If n is a non-negative 

integer, then the domain is R. 


Recall that 1/2? = 2~* and 


Vr=x'/?, 


This derived function gives 
the instantaneous velocity of 
the car, which is what would 
be recorded at each time t on 
its speedometer. 
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CHAPTER C1 DIFFERENTIATION AND MODELLING 


Activity 1.3 Differentiation from the definition 
(a) Use the expansion for (2 + h)® from Example 1.1 to find the expansion 
for (x +h)! =(x+h)(x +h). 


(b) Use equation (1.3) and the result of part (a) to differentiate the 
function f(a) = 2+. 


(c) What is the gradient of the graph of y = x at the point (3, 75)? 
Solutions are given on page 54. 


You have now seen it established that: 
© if f(x) =27, then f’(x) = 2x (Subsection 1.2): 
© if f(x) = 2°, then f(x) = 3x? (Example 1.1); 
© if f(x) =2*, then f(x) = 4r° (Activity 1.3). 
In addition, you know that if f(x) = x then f’(x) = 1, since the line y = x 
has gradient 1, and that if f(x) = 1 then f’(x) = 0 because the line y = 1 
is horizontal and hence has gradient zero. These results can be 
summarised in the single statement 

if f(z) =2", then f'(z)=nz"" for n =0,1,2,3,4. 
In fact, this statement holds not just for these particular values but for any 
real value of n, so we can write 


if f(z) = 2", then f'(r) =nzx""', (1.4) 


where n is any real number. Functions of the form f(x) = x” are called 
power functions, so result (1.4) gives the rule for differentiating any 
power function. This result is often expressed as 


the derivative of 2” is na’ 


Activity 1.4 Differentiation from a formula 


Use result (1.4) to differentiate each of the following functions. 
(a) f(z)=2° — (b) f(z)=1/2*_— (ec) f(z)= Ve («> 0) 


Solutions are given on page 54. 


The concept of differentiation applies equally to situations where variables 
other than « and y are involved. In the course of mathematical modelling 
it will often be the case that other variable labels occur. For example, if s 
(in metres) is the position of a car moving along a straight road, measured 
from a fixed starting point, and ¢ (in seconds) is the time since leaving the 
starting point, then the particular motion of the car may be described by 

s = f(t) for some smooth function f. The derivative of this function gives 
the rate of change of the car’s position with time, which is its velocity 

v= f'(t), measured in metres per second (ms~'). 


ie SECTION 1 WHAT IS DIFFERENTIATION? 


In fact, this example permits a physical interpretation of the 
definition (1.3) for the derivative. If s = f(t), then 


f(t +h) — f(t) 
h 
is the average velocity (distance travelled divided by time passed) over the 
interval between time t and time t + h (assuming h > 0). The 
instantaneous velocity v = f’(t) at time t is the limit of such average 
velocities as the duration h of the time interval tends to zero. 


Example 1.2 Finding the velocity 


A car moving along a straight road has position s = 30t — t? metres, as 
measured in a given direction from a fixed point O, at time t seconds after 
it passes O. 

(a) Find the corresponding velocity v (in ms~') of the car at time ¢. 

(b) What is the velocity of the car as it passes O? 

(c) At what time does the car have velocity zero? 


Solution 
(a) In this case the position function is s = f(t) = 30 — t?. This is a 
quadratic function (of t), so we may apply result (1.1) to obtain the 
velocity function 
v= f(t) =30—2t. 
(b) The car passes O when t = 0, at which time we have v = f’(0) = 30, so 
the velocity is then 30 ms~'. 


(c) When v = 0, we have 30 — 2t = 0, which has solution t = 15. Hence 
the car has velocity zero (is at rest, momentarily) after 15 seconds. 


Activity 1.5 Finding the velocity 


A stone dropped from rest has position s = 5t? metres below its starting 
point after ¢ seconds. 


(a) Find the velocity v (in ms~') of the stone at time t. 
(b) What is the velocity of the stone after 3 seconds? 
(c) At what time does the stone have velocity 15 ms7!? 
Solutions are given on page 54. 


For t > 15 the values of v are 
negative, so the car then 
moves back towards O. 
However. the model may be 
intended to apply only 
for0<t< 15. 


a7 
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Summary of Section 1 


This section has introduced: 
© differentiation (informally) as a process which enables you to find 
either the gradient of a graph, or the rate at which one variable 
changes with respect to another; 
© differentiation of a function f (formally) as the process of finding the 
derived function f’ and derivative f’(a), as defined by 
. (f(a +h) — f(x) 
Hn) — - 
UAC fim ( h ) , 
© the application of differentiation to quadratic functions and to power 
functions f(a) = x", where n is a real number. 


(1.3) 


Exercises for Section 1 


Exercise 1.1 


Use result (1.1) to find the gradient of the graph of y = 6 + 5a — 4x? at the 
point (2,0). 


Exercise 1.2 
(a) Show that if «#0 and 2+h 40, then 
1 1 h 


rth x ~(a+h)ax" 


(b) Use equation (1.3) and the result of part (a) to differentiate the 
function 
fey=4. 
z 
(c) Verify that the result of part (b) can also be obtained by applying 
result (1.4). 


Exercise 1.3 


An aeroplane accelerating along a runway before take-off has travelled a 
distance s = 3t? +t metres from the start of the runway after t seconds. 


(a) Find the corresponding velocity v (in ms~') of the aeroplane at time t. 
(b) What is the velocity of the aeroplane after 10 seconds? 
(c) At what time does the aeroplane have velocity 25 ms~'? 


2 Graphs and optimisation 


In Section 1 the process of differentiation was introduced. You saw how 
this process could be applied in certain cases to find formulas for the 
derivatives of functions, and so obtain values of the gradients at points on 
the corresponding graphs. 


A major application of differentiation is in finding an optimum value of a 
function, that is, a highest or lowest value. In a modelling context this 
may represent the ‘best’ way of achieving some practical outcome, subject 
to certain constraints. For example, it might correspond to a maximisation 
of profit or a minimisation of resource expenditure. 


Optimisation is discussed in Subsection 2.3. In preparation for this topic, 
Subsection 2.2 shows how differentiation can assist in understanding the 
structure of the graph of a given function, which is also useful in its own 
right. The examples here are based on simple polynomial functions. 
Subsection 2.1 shows how to find the derivatives of such functions. 


2.1 Differentiating polynomial functions 


A polynomial function of degree n is a function whose rule has the form 
(2.1) 


where a,, # 0, This includes the cases of linear functions (n = 1), quadratic 
functions (n = 2) and cubic functions (n = 3). For each of the first two of 
these cases, you saw in Section 1 how the corresponding derivative is 
obtained. Thus 

f(x) = a,x + ag has derivative f'(x) = a; 
and 

f(a) = agx + ax + ao has derivative f’(x) = 2ar + ay. 


f(x) = ana” + a,-y2"~! +++ + aQ07 + a,x + ao, 


For the general polynomial function given by equation (2.1), the derivative 
is 


f'(z) = naa"! + (n — Vag_yr"? + +--+ 2097 +a). 


This result can be written concisely, using sigma notation, as 


if f(z) = Car’, then f'(x) = Sia’. (2.2) 
i=0 i=1 
Note. from result (1.4) for the derivative of x", that result (2.2) is 
equivalent to the statement 
if f(x) = > aa’, then f(a) =} a;(derivative of x), (2.3) 
=0 i=l 


which is an alternative way to remember the result. 


Polynomial functions were 
introduced in Chapter A3, 
Section 5. 


The line y = a,x + ao has 

gradient a). The derivative of 
a quadratic function was 
given, with different labels for 
the coefficients, in result (1.1). 


Sigma notation was 
introduced in Chapter B1, 
Subsection 1 


The lower limit in the 
summation for f’(2) in 
result (2.2) can be written as 
i =1 rather than i = 0, since 
the value of the term iajr!~! 
for i = 0 is zero. 
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Comparing 

f(x) = 22° — 9a* + 3a with 
equation (2.1), we have n = 6, 
a; = 3, a4 = —9, ag = 2 and 


ag = 2 = a3 = a5 = 0. 


A polynomial function of 
degree 4 is called a quartic 
function, 
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Example 2.1 Differentiating a polynomial 
Differentiate the polynomial function f(x) = 22° — 9x‘ + 3x. 


Solution 
From result (1.4), the derivative of «° is 6x°, the derivative of x‘ is 4° and 
the derivative of x (= 2") is 1 (= x°). From result (2.3), the derivative of 
the given polynomial function is 
f'() = 2(derivative of x°) — 9(derivative of x*) + 3(derivative of x) 
= 2(6x°) — 9(42°) + 3(1) 
= 122° — 362° + 3. 


‘Activity 2.1. Differentiating polynomials 


(a) Differentiate the polynomial function f(x) = 6a* — 32°. 


(b) What is the gradient of the graph of y = —x* + 5x? — 7x + 15 at the 
point (1,12)? 


Solutions are given on page 54. 


2.2 Features of graphs 


In this subsection you will meet methods for identifying various key 
features of the graphs of functions. The examples involve polynomial 
functions, for which the derivative formula was given in the previous 
subsection. However, the degree of the polynomial function in each case 
will be no higher than 4. 


Several useful results will be stated. Note that these results apply to any 
smooth function, and not just to the types of polynomial function 
considered in the examples. 


We look first at the graph of the cubic function 
f(x) = 2r° + 32? — 122 — 4, 
which is shown in Figure 2.1. 


y=f(z) 


e 
av 


20-4 


Figure 2.1. Graph of y = 2x3 +32? — 12 —4 


The labels in Figure 2.1 refer to the following important features of this 
graph. 
(a) the graph crosses the x-axis at three points, approximately at 
2 =-—3.2, c= —0.3 and x = 2.0; 
(b) the graph crosses the y-axis at one point, where y = —4; 
(c) the graph has a ‘peak’ at approximately (—2, 16); 
(d) the graph has a ‘trough’ at approximately (1,—11); 
(e) to the left of the peak, the graph rises; 
(f) between the peak and the trough, the graph falls; 
(g) to the right of the trough, the graph rises. 
We now consider each of features (a)~(d), in turn. 
The «-values where the graph crosses the x-axis are the solutions of the 
equation 
f(z) =0; 2a + 327 — 127 -4=0. 


This is a cubic equation, and in general such equations cannot be solved 
straightforwardly by hand. 


that is, 


The y-value where the graph crosses the y-axis is f(0) = —4, which is the 
constant term in the polynomial. 


A peak such as the one near (—2, 16) in Figure 2.1 is called a local 
maximum of the function because it gives the greatest function value in 
the immediate vicinity of the point (the word ‘local’ is needed because the 
function may take greater values elsewhere). Similarly, the trough on the 
graph near (1,—11) is called a local minimum. 


Note that this and some other 
graphs in this chapter are 
drawn with unequal scales on 
the two axes. 


In (e)-(g), and in similar 
statements throughout this 
subsection, it is assumed that 
the graph is traversed from 
left to right. 


However, in this particular 
case, it turns out that #« = 2 
is a solution, from which the 
other two solutions can also 
be found if needed. 
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The plural of ‘maximum’ is 
‘maxima’, and the plural of 
‘minimum’ is ‘minima’. 


As you will see, not all 
stationary points are local 
maxima or minima. 


More precisely, there is a 
local maximum at «= 2 if 
there is some open interval /, 
containing ao, such that 

f(x) < f(xo) for any « 4 xo 
in J. A similar statement can 


be made for a local minimum, 


with the inequality replaced 
by f(x) > f(o). 
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In general, at local maxima and minima, the gradient is zero. In other 
words, the tangent at a local maximum or minimum is horizontal, as 
shown in Figure 2.2. 


y= f(z) 


ay 


local 
minimum 


Figure 2.2 Horizontal tangents at local maximum 
and minimum 


Thus the local maxima and minima can be found by identifying those 


values of « for which f’(a) = 0, which are known as the stationary points 


of the function f. 


Stationary point, local maximum and local minimum 

Let f be a smooth function. 

© The function f has a stationary point at x = xp if f'(xo) = 0. 
The corresponding point (xp, f(xo)) on the graph of f is also 
called a stationary point. 


© The function f has a local maximum at a stationary point 
x = 2 if f(x) is greater than any other function value in the 
immediate vicinity of xo. The corresponding point (xo, f(xo)) on 
the graph of f is also called a local maximum. 

© The function f has a local minimum at a stationary point 
@ = Xp if f(xo) is less than any other function value in the 
immediate vicinity of xo. The corresponding point (ao, f(zo)) on 
the graph of f is also called a local minimum. 


SECTION 2 GRAPHS AND OPTIMISATION 


Example 2.2 Finding stationary points 
(a) Differentiate the function 
f(a) = 20° + 32? — 122 — 4, 
whose graph was given in Figure 2.1. 
(b) By solving the equation f’(x) = 0, find the stationary points of f. 
Solution 
(a) From results (1.4) and (2.3), the derivative of the given polynomial 
function is 
f(x) = 2(32?) + 3(2r) — 12(1) 
= 6a? + 6x — 12 
= 6(2? +2 —2). 
(b) The equation f'(x) = 0 is equivalent to 
(x +2)(x - 1) =0. 
This equation has solutions « = —2 and x = 1. Thus the stationary 
points of f are at « = —2 and «= 1, 


x’ +x—-2=0; that is, 


Comment 


By substituting the values 2 = —2 and x = 1 in turn into the rule for f, we 
find that f(—2) = 16 and f(1) = —11. In fact, the two stationary points 

correspond to the local maximum (—2, 16) and the local minimum (1, —11). 
This confirms the values given for features (c) and (d) following Figure 2.1. 


Activity 2.2 Finding stationary points 


(a) Differentiate the function 
f(x) = 2° — 6x? — 15a + 54. 
(b) By solving the equation f(x) = 0, find the stationary points of f. 


Solutions are given on page 55. 


Next we discuss the three features (e). (f) and (g) of the graph in 
Figure 2.1. These relate to the parts of the graph which rise or fall. To 
describe these features more formally, it is convenient to extend the 
concept of an increasing (or decreasing) function, as follows. 


Suppose that f is a real function and that the interval J is in the domain 
of f. Then f is said to be increasing on J if 


for all a,22 in I, if z; <x, then f(x) < f(r»). 
and f is said to be decreasing on J if 
for all 2,22 in I, ifr; <2, then f(a) > f(x2). 


For example, the function f(a) = x? is increasing on the interval (0.00) 
and decreasing on the interval (—oo,()): see Figure 2.3 overleaf. 


Note that, in order to answer 
this type of question, it 
suffices to find the 
x-coordinate of each 
stationary point, unless you 
are explicitly asked to 
calculate also the 
corresponding function value. 


The concepts of ‘increasing” 
and ‘decreasing’ were 
introduced in Chapter A3, 
Subsection 4.1, where they 
applied to the whole domain 
of a function. 
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Note that in this criterion J is 
an open interval. 


In this display the common 
practice of writing ‘for x in 
-..* to mean ‘for all a in...” 
has been used. This practice 
will be followed except in 
statements of definitions and 
results. 
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decreasing 0 increasing r 
on (20,0) on (0,00) 


Figure 2.3 Intervals on which f(x) = x? is 
increasing/ decreasing 


We can often determine intervals on which a function f is increasing or 
decreasing by using differentiation. This is so because the derivative f’(«) 
gives the gradient of the graph at the point (2, f(x)), and the sign of the 
gradient is related to whether the function is increasing or decreasing. 


Increasing/Decreasing Criterion 
Let J be an open interval in the domain of a smooth function f. 


© If f’(x) > 0 for all x in J, then f is increasing on J. 
© If f(x) <0 for all x in J, then f is decreasing on J. 


For example, consider the function f(a) = xc? whose graph is shown in 
Figure 2.3. In this case 
eros f'(x) >0 for x in (0,00), 
BD ai oy <0 for z in (—oo, 0). 
This corresponds to the increasing and decreasing properties of f shown in 
Figure 2.3, in accordance with the Increasing/Decreasing Criterion. 


SECTION 2 GRAPHS AND OPTIMISATION 


Example 2.3 Finding where a function is increasing/decreasing 
In Example 2.2 you saw that the function f(x) = 2x* + 32? — 12x — 4 has 
derivative 

f'(w) = 6(a? + 2 — 2) = 6(a + 2)(2 — 1). 
(a) Show that 


f(x) >0 for x in (—o0, —2), 
{re} <0 for x in (—2,1), 
f(z) >0 for x in (1,00). 


(b) Use the Increasing/Decreasing Criterion to deduce from part (a) 
intervals on which f is increasing or decreasing. % 


Solution 
(a) For x < —2. we have 
2+2<0 and r-1<-3<0, 


so f'(x) = 6(x + 2)(a—1) > 0 (since f'(x) is the product of 6 and of 
two negative numbers). 


For —2 < « < 1, we have 
@+2>0 and r-1<0, 


so f'(x) < 0 (since f’(2) is the product of 6, one other positive number 
and one negative number). 


For x > 1, we have 
@+2>3>0 and x-1>0, 


so f'(x) = 6(a + 2)(x — 1) > 0 (since f’(x) is the product of 6 and of 
two other positive numbers). 


(b 


By the Increasing/Decreasing Criterion, f is increasing on (—o0, —2) 
and on (1,00), but decreasing on (—2, 1). 


Activity 2.3 Finding where a function is increasing/decreasing 
In Activity 2.2 you showed that the function f (2) = 2° — 6a? — 15a + 54 
has derivative 

f' (a) = 8a? — 122 — 15 = 3(a + 1)(a — 5). 
(a) Show that 


f(z) >0 for x in (—c0,—-1), 
f(z) <0 for x in (—1,5), 
f(z) >0 for x in (5,00). 
(b) Use the Increasing/Decreasing Criterion to deduce from part (a) 
intervals on which f is increasing or decreasing. 


Solutions are given on page 55. 
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Figure 2.5 Graph of y = «* 


The connection between derivatives and intervals on which a function is 
increasing or decreasing can be used to provide a useful test for classifying 
a stationary point as a local maximum or a local minimum. This is stated 
in the box below and illustrated in Figures 2.4 and 2.5. 


First Derivative Test 

Suppose that zo is a stationary point of a smooth function f; that is, 

f'(xo) = 0. 

© If f’(x) changes sign from positive to negative as x increases 
through 29, then f has a local maximum at 29. 

© If f’(x) changes sign from negative to positive as x increases 
through 2, then f has a local minimum at zo. 

© If f’(x) does not change sign as x increases through xo, then f 
has neither a local maximum nor a local minimum at xp. 


(a) (b) 
Figure 2.4 (a) Local maximum  (b) Local minimum 


In the case of a local maximum, the function is increasing on an interval to 
the left of zy and decreasing on an interval to the right of x. The reverse 
is true for a local minimum. 


Note, however, that a stationary point (at which f/(x) = 0) need not give 
rise to either a local maximum or local minimum, as demonstrated by the 
example of f (2) = a, whose graph is shown in Figure 2.5. This function 
has a stationary point at 2 = 0, which is neither a local maximum nor a 
local minimum. In this case the function is increasing on all intervals to 
the left or right of z = 0. This is confirmed by the sign of the derivative 
f'(x) = 32x?, which is positive wherever x # 0. 


To determine whether a general function f is increasing or decreasing to 
the left or right of a stationary point 29. it is usually sufficient to check the 
sign of the derivative f/(a) at two points, « = ,, to the left of 9 and 

© = xp to the right of xo, as specified in the following strategy. The First 
Derivative Test can then be applied in the form given in the strategy. 


Strategy 
To identify the local maxima and local minima of a smooth 
function f. 
1. Find the stationary points of f. 
2. For each stationary point x9, 
(a) choose points x, to the left of xp and rp to the right of xo, 


such that the interval [x,, rp] lies in the domain of f and In this chapter we apply this 

there are no stationary points between z;, and xo, nor between strategy only to polynomial 

2% and xp, and then calculate f’(x,) and f'(rR): functions. Hence the interval 
[e1,2R] always lies in the 


(b) classify 2p as follows: 
© if f(x.) > 0 and f’(rp) <0, then f has a local maximum 
at Zo: 
© if f(x.) <0 and f'(x_) > 0, then f has a local minimum 
at Zo; 
© if f'(x,) and f’(x_) have the same sign, then f has 
neither a local maximum nor a local minimum at 29. 


domain of f. 


Remarks 

1. For each stationary point xo, the points x, and xg described in 
Step 2(a) are referred to as ‘test points’. 

2. The choice for a pair of test points is not unique. We usually choose 
test points near 2p that facilitate the calculation of the derivatives in 
Step 2(a). 


Before applying the complete strategy, here is some practice in choosing 
test points. 


In Example 2.2 it was found that the function 
f(x) = 2a° + 3a? — 122-4 


has stationary points at « = —2 and x = 1. Determine a pair of test points 
for each of the stationary points. 


A solution is given on page 55. 


The complete strategy is illustrated in the following example. 


Use the strategy to apply the First Derivative Test to classify each of the Here ‘classify’ means ‘identify 
stationary points of the function as a local maximum, a local 
fc) = 2° —327 +1, minimum, or neither’. 


There are no stationary 
points between —1 and 0, nor 
between 0 and 1. 


The notation f” is 
pronounced as ‘f double dash’ 
or ‘f double prime’. 


Third, fourth and 
higher-order derived functions 
and derivatives may be 
defined in a similar way. 
When more than one order of 
a derivative is involved, f’ (2) 
may be called the first 
derivative (as in First 
Derivative Test). 
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Solution 
Step 1: The derivative of f(x) = x* — 327 +1 is 
f(x) = 3a? — 6x = 3x(x — 2). 
The equation 3a(2 — 2) = 0 has solutions x = 0 and x = 2, so the 


stationary points of f are at r = 0 and x = 2. (These are the only 
stationary points of f.) 


To classify the stationary point at x = 0, we choose test points 
. = —l and rp = 1, say. Then we have 
far) = f'(-1) = 3(-1)(-3) = 9 > 0, 
f'(xr) = f'(1) = 3(1)(-1) = -3 < 0, 
so f has a local maximum at x = 0, by the First Derivative Test. 


To classify the stationary point at z = 2, we choose test points x, = 1 and 
Zp = 3, say. Then we have 


f'(ar) = f'(l) = -3 <0, 
f'(@r) = f'(3) = 3(3)(1) = 9 > 0, 


so f has a local minimum at «x = 2, by the First Derivative Test. 


Activity 2.5 Applying the First Derivative Test 
Use the strategy to apply the First Derivative Test to classify each of the 
stationary points of the function 
f(x) = 324 — 22° — 92? +7, 
noting that the derivative has a factor 2. 


A solution is given on page 55. 


There is another test that is sometimes useful for the purpose of classifying 
the stationary points of a function. This involves the second derivative of 
the function. 


Second derivatives 


When we differentiate a smooth function f, we obtain the derived 
function f’. Often this also is a smooth function, which can itself be 
differentiated. The result of doing so is the derived function of f’, which is 
denoted by f” and called the second derived function of the original 
function f. The value of f” at a given point is called the second 
derivative of f at that point. (Often the term ‘second derivative’ is also 
used to refer to the second derived function f”.) For example, if f(x) = x”, 
then 


File\=22 so 742) =2; 
Since the derived function of any non-constant polynomial function is 


another polynomial function, results (1.4) and (2.3) can be used as before 
to find the second derivative of a polynomial function. 


SECTION 2 GRAPHS AND OPTIMISATION 


Activity 2.6 Finding a second derivative 


You showed in Activity 2.1(b) that the function 
f(v) = —2* +52? —7x+15 

has derivative 
f'(x) = —3x* + 10x — 7. 

Find the second derivative of the function f. 


A solution is given on page 55. 


Second derivatives, like first derivatives, frequently crop up in 
mathematical models. For example, you saw in Subsection 1.3 that if 

s = f(t) (in metres) is the position of a car moving along a straight road, 
measured from a fixed starting point, where t (in seconds) is the time since 
leaving the starting point, then the velocity of the car is represented by 
v= f'(t), in ms~'. A further differentiation gives the acceleration of the 
car, a= f"(t), measured in metres per second per second (ms~*). 


The second derivative is used in a further test for classifying the stationary 
points of a function. To show briefly why this works, suppose that zo is a 
stationary point of a function f, so that f’(o) = 0, and suppose also that 
f"(ao) > 0. Then there will be some open interval J, with xo in J, such 
that f”(x) > 0 for x in J. It follows from the Increasing/Decreasing 
Criterion, applied to the function f’ rather than to f as originally stated, 
that f’ is increasing on J. This means, since f'(2) = 0, that f’ changes 
sign from negative to positive as x increases through xp. Thus zo is a local 
minimum, by the First Derivative Test. A similar argument shows that if 
f' (zo) < 0 at a stationary point xo. then zo is a local maximum. 


Second Derivative Test 

Suppose that zo is a stationary point of a smooth function f; that is, 
f'(xo) = 0. 

© If f"(«o) <0, then f has a local maximum at xo. 

© If f"(xo) > 0, then f has a local minimum at x. 


One way of remembering the details of this test is indicated by the two 
‘faces’ shown in Figure 2.6. The depiction of the ‘nose’ on each face, by 
resembling the double prime notation in f”, is a reminder that the second 
derivative is involved. The ‘mouth’ indicates the shape of the graph close 
to the stationary point xo, while the ‘eyes’ give the corresponding sign 

of f"(o). 


The acceleration is the rate of 
change of velocity v with 
respect to time ¢. 


It is not essential to follow 
the details in this paragraph. 
If you are short of time, then 
move straight to the boxed 
statement below, 


Note that this test gives no 
information if f”(a) = 0. 


Figure 2.6 ‘Faces’ to 
recall the Second 
Derivative Test 
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Figure 2.7 Graph of 
fa) =a" 


In Example 2.4 it was found that the function 
f(w) =a° —327 +1 


has derivative 
f'(x) = 32" — 6x = 32(x — 2) 


and hence has stationary points at « = 0 and « = 2. Use the Second 
Derivative Test to classify each of these stationary points. 


Solution 
The second derivative of f is obtained by differentiating f(x), to obtain 
f" (x) = 6x — 6 = 6(x — 1). 


For the stationary point at x = 0, we have f”(0) = —6 < 0, so f has a local 
maximum at x = 0, by the Second Derivative Test. 


For the stationary point at z = 2, we have f”(2) = 6 > 0, so f has a local 
minimum at x = 2, by the Second Derivative Test. 


Comment 


As expected, these classification results agree with those found by applying 
the First Derivative Test in Example 2.4. 


In Activity 2.5 you found that the function 
f(a) = 324 — 225 — 927 +7 


has derivative 
f'(x) = 1224 — 6x? — 18x = 6r(2x — 3)(x + 1) 


and hence has stationary points at z= —1, x = 0 and x = $. Use the 
Second Derivative Test to classify each of these stationary points. 


A solution is given on page 55. 


The solutions to Example 2.5 and Activity 2.7 suggest that it is easier to 
use the Second Derivative Test than the First Derivative Test when 
classifying stationary points, despite the additional differentiation required. 
Indeed, it is frequently easier to use the Second Derivative Test for 
polynomial functions, but you should be aware that for certain functions 
this test cannot be applied. For example, the function f(x) = xt, whose 
graph is shown in Figure 2.7, has a stationary point at « = 0 which is a 
local minimum. However, in this case f’(x) = 4x*, so f(x) = 12x? and 
hence f”(0) = 0. This means that the Second Derivative Test cannot be 
applied. On the other hand, the First Derivative Test can be applied. In 
the notation of the strategy (page 27), if 2, = —1 and rz = 1, say, we have 


f(a.) = f'(-1) =-4<0 and f'(rp) = f'(1) =4>0, 


confirming the presence of a local minimum. For this reason, it is 
important to be able to use both tests. 


Sketching the graph 

Once the stationary points of a function have been classified, it is often 
possible to produce a reasonable sketch of the graph of the function. For 
example, the function 


Fila)i= a —327+1 The stationary points of this 
ei é. function were found and 
has a local maximum at x = 0 and a local minimum at « = 2. Since we classified in Examples 2:4 


have f(0) =1 and f(2) = —3, the shape of the graph near the points (0,1) anq 2.5. 
and (2,—3) can be indicated as in Figure 2.8(a). Because there are no 

other stationary points, f must be increasing on (—oo,0), decreasing on 

(0,2) and increasing on (2,00), so we can complete a rough sketch of the 

graph as shown in Figure 2.8(b). Note that for such a rough sketch it is 

not necessary to calculate or estimate the values of the x-intercepts, but 

the sketch should show the y-intercept. : 


ya 
local 
maximum : 

(0,1) Here the y-intercept happens 
to correspond to a stationary 
point. 

0 r 


(a) (b) 


P Sad 


Figure 2.8 Sketching the graph of y = 2° 


Though rough, this sketch gives a good impression of the overall shape of 
the graph. In the next activity, you are invited to produce a similar rough 
sketch. 


The stationary points of the function 
f(z) = 324 — 22° — 9x? +7, 


were found and classified in Activities 2.5 and 2.7. They are at = —1 
(local minimum), « = 0 (local maximum) and x = $ (local minimum). 
Find the y-coordinates of each of these points on the graph, and hence 
draw a rough sketch of the graph of this function. 


A solution is given on page 56. 
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All the functions considered as examples in this section are polynomial 
functions. As was pointed out earlier, all the boxed results in this 
subsection apply more generally, to any smooth function f. The First and 
Second Derivative Tests can be used to classify the stationary points of 
many non-polynomial functions, and the classification can be used to help 
sketch graphs of the functions. However, this only becomes a practical 
proposition once you know how to differentiate non-polynomial functions. 
Sections 3 and 4 will widen considerably the range of functions that you 
can differentiate. 


2.3 Optimisation 


In many modelling situations, it is required to find the greatest or least 
value attained by a function f as the independent variable ranges over a 
particular interval (arising from constraints within the model). Two such 
situations are illustrated in Figure 2.9, where the interval is J = {a, b}. 


(a) 


Figure 2.9 Greatest and least values 


In Figure 2.9(a), the greatest value of f is attained at B and the least 
value is attained at A. Both A and B correspond to endpoints of the 
interval. In Figure 2.9(b), however, the greatest value of f is attained at an 
endpoint (B again) but the least value is attained at D, a local minimum. 


The greatest or least values attained by a function on an interval are called 
the optimum values on that interval, and the process of finding these 
values is called optimisation. Since optimum values can occur only at 
endpoints or at local maxima or minima, the following procedure can be 
relied on to find the optimum values and where they occur. 


SECTION 2 GRAPHS AND OPTIMISATION 


Optimisation Procedure 

To find the greatest (or least) value of a smooth function f on a 

closed interval J within the domain of f, proceed as follows. 

1. Find the stationary points of f. 

2. Evaluate f at each of the endpoints of J and at each of the 
stationary points inside J. 

3. Choose the greatest (or least) of the function values found in 
Step 2. 


The following example illustrates the use of this procedure. 


Example 2.6 Finding optimum values 


Find the greatest and least values of the function 
f(x) = —2a° + 32? + 72x 
on the interval J = [—5, 5). 
Solution 
We follow the steps of the Optimisation Procedure. 
Step 1: The derivative of the given function is 
f(x) = —6x? + 6x + 72 = —6(x? — x — 12) = -6(x + 3)(x — 4), 
so the stationary points are at «= —3 and x= 4. 


Step 2: At the endpoints of the interval / = |—5,5], the function values 
are 


f(—5) = 250 + 75 — 360 =—-35 and f(5) = —250+ 75 + 360 = 185, 


while at the stationary points (which are both inside /) the function values 
are 


f(-3) =54+27-216=-135 and f(4) = —128 + 48 + 288 = 208. 


Step 3: The greatest value of f(x) on [—5,5] is 208 (at x = 4), and the 
least value of f(x) on [—5, 5] is —135 (at 2 = —3). 


Activity 2.9 Finding optimum values 


In each of the following cases, find the greatest and least values of the 
function f on the given interval J. 


(a) f(z) =40%-a? 8241 IT =[-3,5] 
(b) f(z) =32?-2r+5 I=[-1,4] 
Solutions are given on page 56. 


Note that it is not essential 
here to classify the stationary 
points. 


If I had been [—2,3], then 
both stationary points found 
in Step 1 would be outside J. 
In this case, the greatest and 
least values of f(a) on J are 
f(3) = 189 and 

f(—2) = —116, respectively. 
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Summary of Section 2 


This section has introduced: 
© the formula for obtaining the derivative of a polynomial function; 


© the stationary points of a function f, being the points where f’(:c) = 0, 
including the cases of a local maximum and a local minimum; 


© the Increasing/Decreasing Criterion, which uses the sign of f'(x) to 
determine whether f is increasing or decreasing on an interval: 


© the First Derivative Test, and an associated strategy, for classifying 
each stationary point of a smooth function as a local maximum, a 
local minimum, or neither of these; 


© the Second Derivative Test for classifying each stationary point of a 
smooth function (where applicable) as a local maximum or a local 
minimum; 


© use of the location and classification of stationary points in sketching 
the graph of a smooth function; 


© the Optimisation Procedure for finding the optimum (greatest or least) 
value of a smooth function on a closed interval. 


Exercises for Section 2 


Exercise 2.1 

(a) Differentiate the polynomial function f(a) = 7a?! — 13a°° + x. 

(b) What is the gradient of the graph of y = 5a* + 4x* — 9x —8 at the 
point (—1,0)? 

Exercise 2.2 

(a) Find the stationary points of the function 

f(a) = —a* — 32? + 97 +5. 

(b) Classify each of the stationary points found in part (a), 
(i) using the strategy to apply the First Derivative Test; 
(ii) using the Second Derivative Test. 

(c) Find the y-coordinates of each of the stationary points on the graph, 
and evaluate (0). Hence draw a rough sketch of the graph of this 
function. 

Exercise 2.3 

In Activity 2.2 you found that the function 
f (x)= 2° — 6x? — 152 +54 

has derivative 
f'(z) = 3x? — 122 — 15 = 3(x + 1)(a — 5) 

and hence has stationary points at « = —1 and x = 5. 


(a) Use the Second Derivative Test to classify each of these stationary 
points. 


(b) Find the greatest and least values of the function f on each of the 
following intervals I. 


(i) T=[0,7] Gi) T=[-4,7] 


3 Differentiating further functions 


To study the latter part of Subsection 3.3 you will need a video player and 
the Video Tape. 


In this section, the main aim is to extend the range of functions which you 
can differentiate. In Subsection 3.1 the derivatives of sin x, cos, e” and 
In are introduced, with some informal justification. These results are 
then tabulated, together with that for the derivative of 2” which you saw 
in Subsection 1.3. 


Subsection 3.2 states two straightforward rules that make it possible to 
differentiate various combinations of these basic functions. Subsection 3.3 
introduces the alternative Leibniz notation for derivatives, which features 
in the Video Tape. 


3.1 Basic derivatives 


We look in turn at several basic functions, reviewing first a case that you 
have already seen. 


Power functions and constant functions 


It was stated in Subsection 1.3 that the derivative of any power function is 
given by the result 


if f(z) =a", then f'(«) =nx", (1.4) 


where 7 is any real number. You saw this formula for f’(x) justified in 
several individual cases (n = 0, 1, 2,3, 4). 


In the case n = 0, the function is simply f(a) = 1, and the derivative 

f'(x) = 0 corresponds to the fact that the graph of y = 1 is a horizontal 
line, which has gradient 0 at all points. It is clear that the same applies for 
any other horizontal line, whose equation has the form y = c where c is a 
constant. Any function f(a) = c, where c is a constant, is called a 
constant function. Thus if f is any constant function, then its derivative 
is zero: 


if f(z) = ¢ (c constant), then f’(x) = 0. (3.1) 


Sine and cosine functions 


There now follows an attempt to make plausible the formulas for the 
derivatives of the trigonometric functions sin and cos, without resorting to 
a formal justification. 


The approach will be the opposite of that employed in Subsection 2.2. 
There information about the values or sign of f’(x), obtained from its rule, 
was used to deduce the behaviour of the graph of f(x). Here, the graph of 
f(z) is used to sketch the graph of f’(x), which suggests what the rule for 
f'(x) might be. We start with the graph of f() = sin, as shown in 
Figure 3.1(a). overleaf. 


These functions were all 
discussed in Chapter A3. 


While the following text 
should give you some idea of 
how the derivative formulas 
for sin and cos 
essential to follow it in order 
to apply these results. If you 
are short of time, you may 
prefer to note results (3.2) 
and (3.3) below, and then 
read on from that point. 
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Note that, in this graph and 
throughout the following 
discussion, « is in radians. 


Remember that your: 
calculator needs to be in 
radian mode for this 
calculation. 
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CHAPTER C1 DIFFERENTIATION AND MODELLING 


Figure 3.1. (a) Graph of y=sinx (b) Graph of y = sin on (0, 2z] 


The graph of sin is periodic, with period 27, so the same will be true of its 
derivative, which gives the gradient at each point of the graph. It suffices 
therefore to consider the portion of the graph in the interval [0, 27], as 
shown in Figure 3.1(b). Within this interval, sin has stationary points 
(zero gradient on the graph) at « = 47 and x = 37, so we must have 


f'n) = f'(Gx) = 0. 
The graph of sin has positive gradient on the intervals (0, $7) and ($7, 27), 
and negative gradient on the interval (47, 37), so 

f'(x) > 0 for x in (0, 77) and for x in ($7, 27), 

f'(x) <0 for x in ($7, $7). 
This assists in sketching the graph of f’(x), since this graph is above the 
x-axis where f’(z) > 0 and below the z-axis where f'(x) < 0. By looking 


at the gradient in Figure 3.1(b), we see that the derived function f" is 
decreasing on the interval (0,7) and increasing on the interval (7,27). 


Finally, we ask what are the greatest and least values that the gradient of 
sin can take within the interval [0,27]. The greatest value occurs where 
the graph is at its steepest increase, which seems to be at x = 0 and 

aw = 27. The least value occurs where the graph is at its steepest decrease, 
which seems to be at z = 7. The symmetry of the graph suggests that the 
gradient at a = 7 is the negative of the gradient at x = 0. 


The following activity indicates that the gradient at x = 0 of the graph of 
sin is close to 1. 


Activity 3.1 Calculating the gradient of sin at x=0 
According to equation (1.3), the derivative of f(x) = sinx at 2 =0 is 


petit at) _ (sinh 
110) = jim (=) = tn (FD: 
Calculate the values of the ratio (sinh)/h for each of h = 0.1, 0.01,0.001, 
giving your answers to six decimal places. To what limit does the ratio 
seem to tend as h — 0? 


A solution is given on page 56, 


The result of Activity 3.1 and the text beforehand suggest that if 
f(x) =sinz, then 


f(O)=1, f()=-1, f'n) 


SECTION 3 DIFFERENTIATING FURTHER FUNCTIONS 


Adding this information to that obtained previously, we can sketch the 
graph of f’(x) as shown in Figure 3.2. 


Figure 3.2 Sketch graph of y = f'(x), where 


J (x) =sine 


This looks like the graph of the cosine function! In fact, the derivative of 


sin is indeed cos; that is, 


if f(z) =sinw, then f’(«) = cosx. 


We could undertake a similar argument to make plausible the formula for 
the derivative of f(a) = cos, but there is a quicker approach, based on 


result (3.2). 


The graph of cos (derivative of sin) is shown in Figure 3.2. This graph is 
the same shape as that of sin, but translated by 47 to the left. Hence the 
graph of the derivative (giving the gradient) of f(a) = cosr has the same 
shape once again, but translated a further 47 to the left. However, the 
outcome of translating the graph of cos x in this way is the graph 

of —sinz. This explains the result that 


if f(x) =cosx, then f'(x) = —sinz. 


Exponential and logarithmic functions 


First recall the shape of graphs of the form y = a* (a constant), some 
instances of which are shown in Figure 3.3. 


Figure 3.3 Graphs of y = a* for a= },1,2,3 


Note that result (3.2) holds 
only when 2 is in radians. 
The same applies to 

result (3.3) below. 


Translating the graph of cos. 
to the left by 37 gives the 

graph of cos(a + 47), as 
explained in Chapter A3. 
Using trigonometric identities 
from Chapter A2, we have 

cos(a + $7) s(4a — (—a)) 
= sin(—2) = 


As with the trigonometric 
functions earlier, it is not 
essential to follow the 
explanatory details here of 
how the derivative formulas 
for exp and In arise. If you 
are short of time, you may 
prefer to note results (3.5) 
and (3.6) below, and then 
read on from that point. 


Exponential functions were 
introduced in Chapter A3, 
Subsection 3.2. 
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The value of e stored by your 
calculator can be found by 
evaluating e! = exp 1. 


Each of the graphs in Figure 3.3 passes through the point (0,1), since 

a°® = 1 whenever a # 0. It seems that the greater the value of a, the larger 
is the gradient of the graph of y = a* at (0,1). For y = 2”, the gradient at 
(0.1) appears to be less than 1, whereas for y = 3” the gradient at this 
point appears to be greater than 1. This can be seen by comparison with 
the broken line through (0,1) which represents y = x + 1 and has 

gradient 1. The number e = 2.718... has the property that the gradient of 
the graph of y = e* at the point (0, 1) is exactly 1, as illustrated in 

Figure 3.4. 


Figure 3.4 Graph of y = e* 


This defining feature of the number e was stated in Chapter A3. In terms 
of derivatives, it means that if f(x) = e*, then f’(0) = 1. According to 
equation (1.3), this is the same as saying that 


PON (sali . (eal 
lim (“=) =1; that is, 1m ( 7 ) —) (3.4) 
The next activity asks you to check that this is the case. 


Calculate the values of the ratio (e” — 1)/h for each of 
h = 0.1, 0.001, 0.00001, giving your answers to six decimal places. To what 
limit does the ratio seem to tend as h — 0? 


A solution is given on page 56. 


It is now straightforward to find the derivative of f(a) =e” from 
definition (1.3). Thus we have 
F(a+h)— f(x) _ et**-e* 
h = h 


teh 
pep os (by a rule for powers) 


On taking the limit as h — 0, we obtain 


fa) = jim (L249) = 112) 


=i eM -1 
semen RTS 


=e" (by equation (3.4)). 


In other words, 
if f(a) =e", then f'(r) =e". (3.5) 


This result states that the derivative of the exponential function is the 
exponential function! 


The following illuminating geometrical interpretation of this result leads on 
to the formula for the derivative of the function Inv. Suppose that P(x, y) 
is any point on the graph of y = e*, and that f is the tangent to the curve 
at P; see Figure 3.5(a). By result (3.5), the line ¢ has gradient e”, which is 
also the ratio ‘rise over run’ for the triangle PQR that has its base QR on 
the a-axis. Since the ‘rise’ RP is y = e”, the ‘run’ QR must be 1. In other 
words, any tangent at (a,y) to the graph of y = e* intersects the x-axis 
one unit to the left, at (2 — 1,0). 


(a) (b) 


Figure 3.5 (a) Tangent £ to graph of y=e" (b) Corresponding tangent ¢’ to 
graph of y= Ing 


This geometrical result helps us to deduce the derivative of the natural 
logarithm function In which, as you may recall, is the inverse of the 
exponential function e”. The graph of an inverse function is obtained from 
that of the original function by reflection in the 45° line, with 

equation y = x. 

Suppose then that the graph and lines in Figure 3.5(a) are reflected in the 
45° line, accompanied by a swapping of the labels 2 and y throughout, so 
we again have a horizontal .-axis and vertical y-axis. The outcome is 
shown in Figure 3.5(b), where P’, Q’, R’ are respectively the images of the 
points P, Q, R, and ¢" is the image of the line 4. The reflected graph has 
equation x = e” or, equivalently, y = Inx. 


The function In was 
introduced in Chapter A3, 
Subsection 4.3. Section 4 of 
that chapter also dealt more 
widely with inverse functions. 


Although you may refer to 
the Handbook for these 
derivatives, they crop up so 
regularly in calculus that it is 
probably worth trying to 
remember them. 


These rules are 
straightforward consequences 
of definition (1.3) of the 
derivative, though the proofs 
are not given here. 
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Now the line ¢’ is tangent to this graph at P’, and the gradient of f’, with 
reference to the triangle P’Q'R’, equals the ‘rise’ Q’R’ = 1 divided by the 
‘run’ R’P’ = x, that is, 1/a. We have therefore arrived at the following 
result: 
i 
if f(x) =Inz (x > 0), then f'(x) = = (x > 0). (3.6) 


The differentiation results encountered in this subsection are collected 
together in the table below. 


Table 3.1 
Function f(z) Derivative f’(x) 
c 0 
a” na") 
sinx cost 
cost —sing 
a e 
Ina (x >0) 1/x (a >0) 


3.2 Sum and Constant Multiple Rules 


The derivatives of some basic functions are given in Table 3.1. Sums and 
constant multiples of these basic functions can be differentiated using the 
following rules. 


Sum Rule 
If k is a function with rule of the form k(x) = f(x) + g(x), where f 
and g are smooth functions, then k is smooth and 


k(x) = f'(x) +9'(2). 


For example, if f(z) = 2? and g(x) = sin, then (from Table 3.1) 
J'(x) = 2x 


so the function k(x) = x? + sin has derivative 


and g'(x) =cosz. 


k'(x) = 2x + cosa. 


Constant Multiple Rule 


If k is a function with rule of the form k(x) = cf(x), where f is a 
smooth function and ¢ is a constant, then k is smooth and 


k(x) = cf" (x). 


For example, if f(x) = x? and ¢ = 5, then the function h(a) = 52? has 
derivative 


k(x) =5 x 2a = 102. 


These two rules are often used in combination. For example. if 
f(x) = 7x? —4sin x, then we can write immediately (using these rules) that 


f'(x) = 7(2x) + (—4)(cosx) = 142 — 4cosz. 


SECTION 3 DIFFERENTIATING FURTHER FUNCTIONS 


In fact, these two rules were in effect taken for granted earlier: 


© in Frame 4 of the audio tape, in giving the gradient formula for a See Subsection 1.2. 
general quadratic function, repeated in result (1.1); 


© in giving the formula for the derivative of any polynomial function, in See Subsection 2.1, 
the form of result (2.2) or (2.3). 


In future they will be used without comment. 


Activity 3.3 Using the Sum and Constant Multiple Rules 


Differentiate each of the following functions. 
(a) f(x) =a -2sine (b) g(x) = 3e” —4Inx 


(c) p(x) = i + 3cosx 
x 


Solutions are given on page 57. 


3.3 Other notations for derivatives 


In Subsection 1.3 you saw the definition of differentiation of a smooth 
function f as the process of finding the derived function f’ of f. There are, 
however, other notations in common use for describing derivatives. Which 
notation to use depends partly on the context. as you will see. 


The function notation f’ is the preferred notation of those who study 

calculus principally as a branch of mathematics. It arises from the view 

that the fundamental objects within calculus are functions rather than 

variables. When calculus was developed, however, in the seventeenth 

century, today’s notion of ‘function’ did not exist. Newton and Leibniz The name of Leibniz is 
were concerned with geometric problems (for example, finding tangents to pronounced ‘lie-b-nitz’. 
curves), so it was natural for them to work with the variables involved in 

the equations of curves. 


‘The notation invented by Leibniz can be used for any pair of related 
variables, and it makes explicit mention of these variables. For example, if 
the variables y and x are related by the equation 

y=2", 


then Leibniz would have spoken of ‘the derivative of y with respect to x 
and written this as 


wu = 2x. The derivative on the 
left-hand side may also be 
Here dy/dx has the same meaning as f(x), where f is the function written as dy/da. It is 
f(x) = 2?. Note that the expression dy/dx should be regarded as a pronounced as 
complete symbol since, despite its appearance, it is not a fraction in the ‘dee-y by dee-x’, or sometimes 


ordinary sense. In particular, the two ds which appear in it do not cancel. _just as ‘dee-y dee-x’. 
Nor do the subexpressions dy and dx have any individual meaning. 
When the derivative for a particular value of the independent variable is 
required, the Leibniz notation has to be adapted. For example, if 
f(x) =z? then we have f’(3) = 6. In Leibniz notation, with y = 2”, this 
value of the derivative at 2 = 3 would be expressed as 
os es 
dz\p3 
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The Greek letter 6 (delta) is 
placed in front of a variable 
to indicate a small change in 
that variable. Symbols such 
as 6x and dy are ‘complete 
symbols’, not products. 


This matches the definition of 
derivative in equation (1.3), 
with h replaced by dx and 
S(a+h) — f(x) replaced 

by dy. 


This is the situation which 
was considered in 
Example 1.2. 


See Subsection 2.2, in the 
text immediately after 
Activity 2.6. 


2 
The symbol es, is 


pronounced as ‘dee-two-s by 
dee-t-squared’. 
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Leibniz arrived at his notation by thinking of the derivative as a means of 
describing the ‘rate of change’ of a dependent variable as the independent 

variable changes. This may be visualised by starting from a point (a, y) on 
the graph of y = f(x), and then making a small change dz in x which leads 
to a small change dy in y; see Figure 3.6. 


a v= f(z) 


(x + dx, y + by) 


ay 


) z tie 
Figure 3.6 Small change x in « and 
corresponding small change dy in y 


The ratio dy/éz is the ‘average’ rate of change of y with respect to x over 
the interval [2,2 + dx]. Leibniz used the notation dy/d.r to indicate the 
limiting value to which the quotient 6y/éx approaches as dc becomes 
small; that is, 


dy = lim oy 

dx 6x0 bx" 
It is then natural to regard the derivative dy/d as the ‘instantaneous’ rate 
of change of y with respect to x. While dy/dzx is not itself a quotient, it is 
the limit of quotients of the form dy/éx as 6x — 0. 
As mentioned earlier, the Leibniz notation can be used whenever two 
variables are related. For example. the equation 


A=" 
relates the area A of a circle to its radius r. The derivative of A with 
respect to r is 

dA _ 

ai 
This represents the instantaneous rate of change of A with respect to r. 


nr. 


As another example, if a car moving along a straight road has position 
s = 30¢ — # metres from its starting point after t seconds, then its velocity 
v (in ms~') at time ¢ is given by 

ds 

ra 30 — 2t. 
For this example, a further differentiation with respect to t gives the 
acceleration a of the car, which we originally denoted by a = f”(t) (where 
s = f(t)). In Leibniz notation, this is written as 

Ps 


ts 


v= 


SECTION 3 DIFFERENTIATING FURTHER FUNCTIONS 


In general, if y = f(x), the function notation and Leibniz notation are 
matched by the statements 


If we have (say) s = 30t — #?, a variation of the Leibniz notation for the 
derivative is to write 


d ae 2 
a or sot - ©). 


The latter form enables us to write derivative results very concisely. For 
example, two of the derivatives from Table 3.1 can be expressed as 

d 

dx 
The Leibniz notation is often used in applied mathematics, science and 
engineering, where there tends to be greater emphasis on variables than on 
functions. The variables frequently represent quantities with a physical 
interpretation, which are related to one another within a mathematical 
model. 


(x")=nz""" and aes =-—singz. 
dx 


One other notation is worth mention here. In textbooks on motion, 
especially in physics and engineering, differentiation with respect to time t 
is often represented by a dot above the dependent variable, as follows: 

ds 

dt” 

This notation was devised by Newton, who used calculus to study motion. 
The corresponding second derivative is denoted by 

ds 

dt2* 


Activity 3.4 Using other notations 


Find each of the following derivatives, using the notation indicated. 


Apps 


dV eV 4 
(a) ms and 7 where V = 


(b) f’(5) and f"(3), where f(x) = 45Ina 


(c) 2 Ie where s = 10 — 5t? 


(d) @ and ti, where u = 3cost 
Solutions are given on page 57. 


This section concludes with a video band that provides a further 
explanation of most of the results in Table 3.1, page 40. This video band, 
which uses the notation exp rather than e” for the exponential function, 
includes animation of tangents to curves and use of a hypothetical device 
called a ‘gradient meter’. 


Now watch Video Band C(i), ‘Visualising differentiation’. 


The notation ( ) is 


pronounced as ‘dee-by-dee-t 
of( )’. 


The expression § is 
pronounced as ‘s dot’. 


‘The expression & is 
pronounced as ‘s double dot’, 
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Summary of Section 3 


This section has introduced: 


© the formulas for the derivatives of some basic functions, as listed in 
Table 3.1 on page 40; 


© the Sum and Constant Multiple Rules for differentiating combinations 
of smooth functions; 


© the Leibniz notation and Newton's notation for derivatives. 


Exercises for Section 3 


Exercise 3.1 
Differentiate each of the following functions. 
(a) u(x) =4sinx —5cosx (b) v(x) = 7Inz — 2e* 
Vexz 
(c) w(w) = 3x? -2r +1--+ 5 
x i 
Exercise 3.2 
Find each of the following derivatives, using the notation indicated. 
CP 
(a) ie and —, where P = cosu —sinu 
du du? 
(b) & and #, where v = —e' + 17t+3 
(c) = where w = 5st — 10s? 
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4 Products, quotients and composites 


In Subsection 3.1 you saw the derivatives of various basic functions, and in These derivatives are given in 
Subsection 3.2 you met the Sum Rule and Constant Multiple Rule for Table 3.1 on page 40. 
differentiating sums or constant multiples of smooth functions. By using 

Table 3.1 and these two results you can differentiate many functions, 

including all polynomial functions. 


In this section you will see how to differentiate various types of function 
whose rules have a more complex structure. The three subsections deal in 
turn with functions that can be viewed as products, quotients or 
composites of a pair of more basic functions. At the end of Subsection 4.3, 
a table of derivatives is given which is a more general version of Table 3.1. 


4.1 Product Rule 


Here is an example of a function that cannot be differentiated using the 
Sum and Constant Multiple Rules: 


k(x) =a? sine. (4.1) 
This function, k(x) = x? x sinx, is the product of the two basic functions 
f(x) = 2 and g(x) = sin, each of which you already know how to 


differentiate. In order to differentiate k, therefore, we need a ‘product rule’ 
for differentiation. Such a rule is available, and is stated below. 


Product Rule 


If k is a function with rule of the form k(x) = f(x)g(x), where f and This rule and the others 
g are smooth functions, then & is smooth and quoted in this section are 
, ‘ ; consequences of 
Ri(a) = f(z) 9(a) + f(z)9'(2)- (42) | definition (1.3) of the 


derivative, though the proofs 


; 5 fs é are not given here. 
The right-hand side of equation (4.2) is the sum of two terms, each of 


which is the product of one of the constituent functions, f or g, with the 
derivative of the other. 
For example, if k(2) = 2? sin, as in equation (4.1), then k(x) = f(x)g(x), 
where f(x) = x? and g(x) = sin. Now f'(x) = 2x and g’(x) = cosx so, by 
the Product Rule, we have 
W(x) = f'(x)g(x) + F(x)9'(x) 
= (2x)(sin x) + (x*)(cosx) 
= 2rsinx + x? cosx. 


Here are some similar examples for you to try. 


Activity 4.1 Using the Product Rule 


By first identifying each of the following functions as a product of the form 
f(x)g(x), and then using the Product Rule, differentiate each function. 
(a) k(x) = axe* (b) A(x) = (22? -1)cosr = (c) k(x) = x(x +sinz) 


Solutions are given on page 57. 
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Both this and the earlier form 

of the Product Rule can be 

written in different ways. For 

example, the right-hand side 

here can also be written as 
dy du 


u— +v—. 
dae dx 


There is no danger of dividing 
by 0 here since any points x 
for which g(a) =0 are not in 
the domain of k. 
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With practice, it is possible to apply the Product Rule informally, without 
assigning names to the two constituent functions. by remembering in 
words that the derivative of ‘product = (first)x(second)’ is given by 


derivative derivative 
of product of first 


) x (second) + (first) x ees 


of second 


You should aim to be able to differentiate a function which is a product in 
this informal manner because it is often inconvenient to give the names f 
and g to the constituent functions, as the following activity demonstrates. 


Activity 4.2 Using the Product Rule informally 
Differentiate each of the following functions. 
(a) f(x) = (3a? + 2x + 1)e” (b) g(x) =sinaxcosx 
(c) f(t) =(#+3)Int 
Solutions are given on page 57. 


In Subsection 3.3 we described the alternative Leibniz notation for 
derivatives. In terms of this notation, the Product Rule may be expressed 
as follows. 


Product Rule (Leibniz form) 
If y = uv, where u = f(x) and v = g(x), then 


To compare this equation with equation (4.2), it is necessary to let 
y = k(2). 


4.2 Quotient Rule 


Now consider again the basic functions f(a) = x? and g(x) = sinx. The 
function 

ey Acdece 
i= giz) sing te) 
cannot be differentiated using the Sum, Constant Multiple and Product 
Rules. In this case k(2) is the quotient of f(x) and g(x); the function k 


can be differentiated using the following rule. 


Quotient Rule 

If k is a function with rule of the form A(x) = f(x)/g(x), where f and 
g are smooth functions, then k is smooth and 

ate) f(x) = flag (a) als 


B@) =" Gta)? 


The numerator of the right-hand side of equation (4.4) is the difference of 
two terms, each of which is the product of one of the constituent functions, 
f or g, with the derivative of the other. 


SECTION 4 PRODUCTS, QUOTIENTS AND COMPOSITES 


For example, if k(x) = x?/ sin, as in equation (4.3), then 
K(x) = f(x)/g(x). where f(x) = x? and g(x) =sinz. Now f'(x) = 2z and 
g'(x) = cosx so, by the Quotient Rule, we have 
(on) — Ge)E (@) = F(a) g(x) 
ae ow)? 
_ (sinx)(2x) — (a?)(cosx) 
a (sin x)? 
_ 2rsinz — 2* cosr 
a sin? x 


Here are some similar examples for you to try. 


Activity 4.3 Using the Quotient Rule 


By first identifying each of the following functions as a quotient of the form 
f(v)/g(x), and then using the Quotient Rule, differentiate each function. 


(@) Kay=5 (bay = SP) ay = 


Solutions are given on page 57. 


As with the Product Rule, you should aim to apply the Quotient Rule 
without assigning names to the constituent functions, by remembering in 
words that the derivative of ‘quotient = (top)/(bottom)’ is given by 


bottom x ( “tivative Eta derivative 
- a of top oP, of bottom 


(bottom)? 


( derivative 
of quotient 
Activity 4.4 Using the Quotient Rule informally 
Differentiate each of the following functions. 
t 
(©) pt) =>, 


sine Inu 
(a) f()= cos wr w+3 B= i 


(b) g(u) = 
Solutions are given on page 58. 


Comment 


Since tan = sin x/ cos, the result of part (a) shows that the derivative of 
f(z) =tanz is f'(x) = sec? z. 


In Leibniz notation, the Quotient Rule may be expressed as follows. 


Quotient Rule (Leibniz form) 


If y = u/v, where u= f(x) and v = g(x) 4 0, then As with the Product Rule, 
the right-hand side here can 
dy a oo (-E Lg ut) be written in other ways, for 
de uw? \ da. dx example, as 
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The composite function 

k(x) = g(f(2)) is defined only 
if any possible output from f 
is an allowable input for g. In 
the language of Chapter A3, 
Section 1, the image set of f 
must be contained within the 
domain of g. 


Note that cos? « means 
(cosx)?. 


CHAPTER C1 DIFFERENTIATION AND MODELLING 


4.3 Composite Rule 


In the previous two subsections, we used the basic functions f(x) = x? and 
g(x) = sin to construct the product f(x)g(x) = x? sinx and the quotient 
f(x)/g(x) = z?/sina. There is yet another way in which these two basic 
functions can be combined to form a different type of function. 


If we first apply the function f to x, to obtain f(x), and then apply the 
function g to f(x), we obtain the function 


k(x) = 9(f()) = sin(2*). (4.5) 
A function & with rule of the form k(x) = g(f(x)) is called a composite 


function, or informally a ‘function of a function’. The expression g( f(x)) 
can be thought of as 


g(u), where u= f(x). 


The intermediate variable u labels the outputs from the ‘inner’ function f, 
which are also the inputs to the ‘outer’ function g. The rule for g in the 
case of equation (4.5) can be expressed as g(u) = sin u. 


Some other examples of composite functions are: 
k(x) =e?*, where u = f(x) = 2x and g(u) =e": 
k(x) =cos*x, where u = f(x) = cosa and g(u) = w. 


Activity 4.5 Recognising composite functions 


Identify each of the following as a composite function of the form 
k(x) = 9(f(x)). by specifying the rules for the ‘inner’ function u = f(x) 
and for the ‘outer’ function g(u). 


(a) k(x) = In(a? +1) (b) k(x) = cos(a*) (ec) k(x) = cos’ x 
(d) k(x) = = 
Solutions are given on page 58. 


To differentiate composite functions, we use the following rule. 


Composite Rule 


If k is a function with rule of the form k(x) = g(f(«)), where f and g 
are smooth functions. then k is smooth and 


K(x) = 9(f(a))f'(z)- (4.6) 


This rule needs to be applied carefully. The right-hand side of 

equation (4.6) is a product of two derivatives, but the first of these involves 
the function g’ evaluated at f(x). When first learning how to use the 
Composite Rule, it is helpful to replace f(a) by the variable u, as in the 
text about composite functions above. 


For example, if k(x) = sin(a*), as in equation (4.5), then k(x) = g(f(z)), 
where u = f(x) = 2? and g(u) =sinu. 


SECTION 4 PRODUCTS, QUOTIENTS AND COMPOSITES 


Now f'(x) = 2a and g’(u) = cosu so, by the Composite Rule, we have 
K(x) = 9 (F(z) F(x) 
=4g'(u)f'(z) (where u= f(a) = 2*) 
= (cos u)(2r) 
=cos(z?) x 22 (since u = x") 
= 2x cos(x”). 


Here are some similar examples for you to try. 


Activity 4.6 Using the Composite Rule 


By first identifying each of the following functions as a composite of the 
form k(x) = g(f(«)), and then using the Composite Rule, differentiate 
each function. 

(a) k(x) =e" — (b) k(x) =sin(2r) —(c)_ A(x) = cos(e*) 


Solutions are given on page 58. 


Once again, you should aim to apply the Composite Rule without 
assigning names to the constituent functions, by remembering in words 
that the derivative of ‘composite = outer(inner)’ is given by 


derivative \\ _ (derivative derivative If k(x) = g(f(z)), then f is 
of composite) ~ \ of outer ) * \ of inner )* the inner function and g the 
outer function. 
However, you must also remember that the derivative of the outer function 
g is evaluated at f(2), which means that the ‘words’ version above is not 
self-contained in the manner of the corresponding statements for the 


Product and Quotient Rules, Although you should aim eventually to be You should explicitly assign 
able to differentiate a function which is a composite in this informal names to the constituent 
manner, mastery may take longer than with the earlier rules. functions for as long as you 


find this helpful. 


Activity 4.7 Using the Composite Rule informally 


Differentiate each of the following functions. 
(a) f(x) = cos(4ar) (b) g(a) = In(a? + x + 1) (c) v(t) = sin(cost) 
Solutions are given on page 58, 


The Leibniz form of the Composite Rule is particularly memorable. When 
stated in this form, the result is usually referred to as the Chain Rule. 


Chain Rule (Leibniz form of Composite Rule) 
If y = g(u), where u = f(x), then 
dy _ dy du 


dx dudx’ 


This version has the merit (for memorising its form) but also the danger of 
looking like an obvious statement about fractions (which it is not!). 


49 


You have already 
differentiated specific 
functions of this type: 

sin(2x) in Activity 4.6(b), and 
cos(4x) in Activity 4.7(a). 


This table, in which a is a 
non-zero constant, appears in 
the Handbook, Note that the 
results of Activi 4.6(b) 
and 4.7(a) are consistent with 
entries in this table. 
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For example, if y = sin(x?), then y = sinu, where u = 2”, so 


dy _ du _ 

Priahs ee 2x and 
en ee 
ae Sade (cos u)(2x) = 2x cos(x*). 


Finally, note that the Composite Rule enables us to generalise somewhat 
the results of Table 3.1, to cope with functions such as sin(ax), cos(az), 
e*” and In(ax), where a is any non-zero constant. For suppose that the 
derivative of the function g is known, and that the function k is defined by 
k(x) = g(ax). This has the form of a composite function k(x) = g(f(x)), 
for which the inner function is f(a) = ax, with derivative f'(x) =a. It 
follows from the Composite Rule that 


K(x) = g'(F(x))f'(2) = ag'(ar). 


By applying this result to the functions in the last four lines of Table 3.1, 
we obtain the amended table below. 


Table 4.1 
Function f(a) Derivative f’(x) 
c 0 
a” nen} 
sin(ar) acos(axr) 
cos(ax) —asin(axr) 
et ae™ 


In(ax) (aa > 0) 1/x (ax >0) 


Summary of Section 4 


This section has introduced: 


© the Product Rule, Quotient Rule and Composite Rule (Chain Rule) 
for differentiating functions which are appropriate combinations of 
other more basic functions; 


© the formulas for the derivatives of some basic functions of the 
form f(x) = g(ax), where a is a constant. as listed in Table 4.1 above. 


Exercises for Section 4 


Exercise 4.1 
Use the Product Rule to differentiate each of the following functions. 


(a) f(x) = (2? + 2)sine (b) g(a) =e” cos (c) u(t) = Vtlnt 


Exercise 4.2 


Use the Quotient Rule to differentiate each of the following functions. 


4 * ae 
@) o@)== wm ey=BE" | sw=s 


Exercise 4.3 
Use the Composite Rule to differentiate each of the following functions. 
(a) k(r)=sin(Inz) (b) k(x) =sintx —(c) k(x) = e* 

1 


(a) p(t)=In(e" +t) — (©) as) = cos (3) inrl= a5 


5 Optimisation with the computer 


To study this section you will need access to your computer, together with 
Computer Book C. 


In this section you will see that the computer can be used to find the 
derivative of any smooth function whose rule is entered. This means that, 
even where the function concerned is too complicated to permit the general 
differentiation results in this chapter to be applied with ease ‘by hand’, an 
answer can still be obtained from the computer. Equally, the computer can 
be used to provide an independent check on any calculation done by hand. 


Note, however, that this does not absolve you from the need to be able to 
differentiate by hand! Just as with calculators, there are certain 
calculations in differentiation which are sufficiently simple (with practice) 
to make it worthwhile to do them by hand rather than by switching on the 
computer. 


In more complicated cases the computer may provide the only viable 
option, and can then be extremely useful. As you will see, it can be used 
also to find precisely and to clas stationary points and to apply the 
Optimisation Procedure described in Subsection 2.3. Since the graph of 
any function can be plotted rapidly on the computer, it provides an 
efficient alternative to the graph-sketching techniques introduced in 
Subsection 2.2. 


Refer to Computer Book C for the work in this section. 


Summary of Section 5 


In this section you saw how the computer can be used to find derivatives 
symbolically, and you applied this facility in the context of examples which 
called for optimisation. 


Cs) 


You are expected to be able 
to differentiate ‘by hand’ any 
function appearing in 

Table 4.1 on page 50, together 
with any constant multiples, 
sums, products, quotients or 
composites of these functions. 


You saw a rough form of 
optimisation, based on a 
plotted graph, carried out in 
Chapter A3, Section 5. 


51 


Summary of Chapter C1 
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In this chapter you met differentiation, which is the process that allows 
calculation of the rate at which one variable changes with respect to 
another variable. This can be visualised as the gradient of a corresponding 
graph. The process can be used to find the optimum value of a function 
over an interval, or to assist in sketching the graph of a function. 


Table 4.1 on page 50 lists the derivatives of certain basic functions. Other 
functions may be built up from these by taking constant multiples, sums, 
products, quotients or composites of them. The chapter introduced rules of 
differentiation for all of these cases. 


Learning outcomes 


You have been working towards the following learning outcomes. 


Terms to know and use 


Calculus, gradient, tangent, smooth, rate of change (average, 
instantaneous), derived function, derivative, differentiation, power 
function, quartic function, local maximum, local minimum, stationary 
point, increasing/decreasing on an interval, second derived function, 
second derivative, optimum value, optimisation, constant function, 
composite function. 


Symbols and notation to know and use 
The various notations for first and second derivatives: 
© f(x), f"(x) (function notation); 
dy dy d 
dx’ dx?’ dir 
© 4,8 (Newton's notation). 
The notation for limits, as applied in the definition of the derivative: 


(ferme sey 


(y). <(sle)) (Leibniz notation); 


° f(z)= jim (function notation); 


di 6 F * 
ne a Elite z (Leibniz notation). 


br—0 OF 

Mathematical skills 

© Differentiate ‘by hand’ any of the basic functions whose derivatives are 
given in Table 4.1 on page 50. 

© Use the appropriate rules to differentiate any constant multiple of a 
basic function, or any sum, product, quotient or composite of a pair of 
basic functions. 

© Use differentiation to find instantaneous rates of change, including 
velocity (given a position function for an object), and evaluate the rate 
of change at any value of the independent variable. 


© Use differentiation to find the gradient of a graph, and evaluate the 
gradient at any point on the graph. 


SUMMARY OF CHAPTER Ci 


© Apply the limit definition of the derived function/derivative in very 
simple cases. 

© Differentiate twice to find second derivatives. 

© Apply tests to determine whether a stationary point is a local 
maximum or a local minimum (or neither). 

© Draw a rough sketch of the graph of a simple function, based on the 
location and classification of stationary points and the position of the 
y-intercept. 

© Find the maximum or minimum value of a function on an interval. 

Mathcad skills 

© Apply Mathcad to differentiate an expression with respect to a’ 
specified variable, by using Mathcad’s d/dx operator and symbolic 
evaluation. 

© Use the copy and paste facilities in Mathcad. 


Ideas to be aware of 


° 
° 


The difference between average and instantaneous rates of change. 
That the graph of a smooth function has a gradient at each point on 
it, as given by the gradient of the tangent at the point. 

That the derived function (derivative) of any smooth function is found 
by differentiation, and is equivalent both to the gradient of the graph 
of the function and to the instantaneous rate of change of the 
dependent variable with respect to the independent variable. 

Some possible uses for differentiation. 


The concept of an optimum value, and its relationship with the 
derivative of a function. 
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Solution 1.1 


In each case we apply result (1.1), which states that 

if f(x) = ax? + bx +c, then f'(x) = 2ax +b. 

(a) For f(a) = 2? +a +7, we have a= 1,b=1 and 

c=7, 80 f'(x) =2r+1. 

For f(a) = 5a? — a, we have a = 5, b= —1 and 

c=0, 80 f"(«) = 10x —1. 

(c) For f(x) =5 +32 — 
and ¢=5, so f'(2) = —x +3. 


(o 


Solution 1.2 


(a) Result (1.1) gives f’(a) = 2x, so the gradient at 
the point (4, +), where a = 3, is 
f(}) =2xd=1. 
(b) Result (1.1) gives f’(a) = —2x, so the gradient 
at the point (3,—8) is f"(3) = —2 x 3=—6. 
(c) Result (1.1) gives f’(x) = 6x + 32, so the 
gradient at the point (1,—207) is 
f'() =6 x 1 +32 =38. 
Solution 1.3 
(a) From Example 1.1, we have 
(a +h)? = 25 + 307h + 3ah? + hi, 
so we deduce that 
(a +h)! =(«+h)(x+h)® 
= (w+ h)(a* + 327h + 30h? +h) 
=a + 4a%h + 6x7h? + dh + hi, 


(b) By equation (1.3), we need to consider the 
quotient 
f(a +h) ~ f(x) _ (w@+hy* -x! 


h h “ 
where fh is a non-zero number. Using the 
expansion for (x +h)* from part (a), we have 


f(a +h) = f(x) _ 42h + Gah? + 4h + ht 


h h 

= 4x + 62h + dich? +h. 
For small values of h, the expression 
4x* + 627h + 4h? + h3 is close to 4° because 
each of 6::h, dich? and h® is small. Thus in the 
limit as h — 0, we obtain f(x) = 4°. 

(c) The point (3, 7%) on the graph corresponds to 
x = $, for which the gradient is 
14) = 4G)" =4G@) = $- 


Solution 1.4 


In each case, we apply result (1.4), which states that 
if f(x) =2", then f(x) =nz"—!. 
(a) Here f(x) =2°, son =6. The derivative is 
S'(@) = 62" = 62°. 
(b) Here f(x) = 1/23 = 2-4, son = —3. The 
derivative is 
f(a) = -32-8* = 32-4, 
which is often written as f!(x) = —3/24. 
(c) Here f(x) = Vz =<'/?, son =}. The 
derivative is 
f(a) = yaV27) = Ly V2, 


which is often written as f"(x) = 1/(2V2), 


Solution 1.5 

(a) The position function is s = f(t) = 5t?. This is a 
quadratic function of t, so we apply result (1.1) 
to obtain the velocity function 

v= f’(t) = 108. 

(b) When t = 3, we have v = f'(3) = 10 x 3 = 30, so 
the velocity is then 30 ms~!. 

(c) When v = 15, we have 15 = 10t, which has 
solution ¢ = 1.5. Hence the stone has velocity 
15 ms~! after 1.5 seconds. 


Solution 2.1 
(a) From result (1.4), the derivative of 2° is 8x7 and 
the derivative of ° is 5a. From result (2.3), the 
derivative of the polynomial function 
f(x) = 6r* — 32° is 
f'(x) = 6(derivative of 2°) — 3(derivative of *) 
= 6(827) — 3(52*) 
= 4827 - 152°. 
(b) From results (1.4) and (2.3), the derivative of 
the polynomial function 
F(x) = —2? + 52? — 72 + 15 is 
J'(x) = —(82?) + 5(2x) — 7(1) 
= —3a? + 10x —7. 
The point (1.12) on the graph of y = f(x) 
corresponds to x = 1, for which the gradient of 
the graph is f’(1) = —3(1)? + 10(1) —7=0. 
(This outcome indicates that the graph of the 


given function f(x) has a horizontal tangent at 
the point (1, 12).) 


Solution 2.2 


(a) From results (1.4) and (2.3), the derivative of 
the polynomial function 
f(z) = 23 — 6x? — 152 + 54 is 
f' (x) = (3x?) — 6(2x) — 15(1) 
= 3r? — 122-15 
= 3(x? — 42 —5). 
(b) The equation f’(2) = 
x? —da —5 =0; that is, (x +1)(4—5) = 


0 is equivalent to 


This equation has solutions « = —1 and x = 5. 
Thus the stationary points of f are at 2 = —1 
and x = 5. 

Solution 2.3 


(a) For « < —1, we have 
w+1<0 and r—-5<-6<0, 


so f'(x) = 3(a + 1)(a—5) > 0 (since f(x) is the 
product of 3 and of two negative numbers). 


For —1 <a <5, we have 
w+1>0 and r-5<0, 


so f'(x) < 0 (since f(x) is the product of 3, one 
other positive number and one negative 
number). 


For x > 5, we have 
a+1>6>0 and r-5>0, 


so f'(x) = 3(a + 1)(x — 5) > 0 (since f(x) is the 
product of 3 and of two other positive numbers). 


(b 


By the Increasing/Decreasing Criterion, f is 
increasing on (—9°0, —1) and on (5,00), but 
decreasing on (—1,5). 


Solution 2.4 


For the stationary point at « = —2, we have to 
choose test points x, to the left 

of —2 and xp to the right of —2 such that there are 
no stationary points between x, and —2 nor between 
—2 and xg. Bearing in mind Remark 2, page 27, and 
the fact that the only other stationary point is at 1, 
we can take (say) 2, = —3 and rp = 0. (Another 
choice is x, = —3 and rz = —1, but choosing rz = 0 
simplifies the calculation of f’(aR).) 


For the stationary point at + = 1, we have to choose 
test points 2, to the left of 1 and zp to the right of 1 
such that there are no stationary points between x. 
and 1 nor between 1 and ay. Bearing in mind 
Remark 2, page 27, and the fact that the only other 
stationary point is at —2, we can take (say) 2, =0 
and rp = 2. 


SOLUTIONS TO ACTIVITIES: 


Solution 2.5 
Step 1: The derivative of the function 
f(x) = 3a4 — 2x3 — 9x? +7 is 
f'(x) = 122° — 6x? — 182 
= 6x(2x? — x — 3) 
= 62(2r — 3)(x +1). 


Solving the equation f'(x) = 0, we find that the 


stationary points of f are at «= —1, rx =0 
and x = 3. 
Step 2: To classify the stationary point at « = —1, 
we choose test points ar, = —2 and rp = —4, say. 
Then we have . 

J'ai) = f'(—2) = 6(—2)(—7)(-1) = -84 <0, 


J'(er) = f'(—3) = 6(—3)(-4) (5) = 6 > 0, 
so f has a local minimum at « = —1, by the First 
Derivative Test. 


To classify the stationary point at 2 = 0, we choose 
test points zp = -3 and zy = 1, say. Then we have 


S'(x1) = f'(-}) = 6 > 0, 

S'(ar) = £'(1) = 6(1)(-1)2) = 
so f has a local maximum at a = 0, by the First 
Derivative Test. (Note that the value of f’(—4) was 


found when classifying the stationary point at 
r=-l.) 


-12 <0, 


To classify the stationary point at x ; we choose 
test points zy, = 1 and xp = 2, say, Then we have 


f(er) = f') = -12 <0, 
fee) = f'(2) = 6(2)(1)(3) = 


so f has a local minimum at x = 3, by the First 
Derivative Test. 


36> 0, 


Solution 2.6 


The second derivative of the function f is the (first) 
derivative of the function f’(2) = —3x? + 10a —7. 
From results (1.4) and (2.3), this is 


f(x) = -3(2x) + 10(1) 
= —6r + 10. 


Solution 2.7 


The second derivative of f is obtained by 
differentiating f(x) = 122% — 6x? — 182, to obtain 


f"(x) = 36x? — 12r — 18 = 6(62? — 2x — 3). 
For the stationary point at 2 = —1, we have 
f"(-1) = 6(6(-1)? — 2(-1) — 3) = 30> 0, 


so f has a local minimum at x = —1, by the Second 
Derivative Test. 
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For the stationary point at « =0, we have 
£''(0) = 6( 


so f has a local maximum at « = 0. by the Second 
Derivative Test. 


—3) =—18 <0, 


For the stationary age at x= 3, we have 
f"(3) = 6(6(3)? — 2(3) — 3) 
= 6(2 -3-3) =45>0, 


so f has a local minimum at x = 3. by the Second 
Derivative Test. 


(These classification results agree with those that you 
found using the First Derivative Test in Activity 2.5.) 


Solution 2.8 


The function is f(a) = 3a — 23 — 9x? +7, and the 
y-coordinates of stationary points on the graph of 
y = f(z) are 


f(-1) =3+2-94+7 f(0)=7 and 
s(Q)= 48-9 -947- 


Hence the graph passes through the points (—1,3), a 
local minimum, (0,7), a local maximum (on the 
y-axis), and (3,—47#), a local minimum. A sketch of 
the graph is as follows. 


ut y= f(z) 


ay 


3 ~B) 


Figure S.1 
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Solution 2.9 


In each case, we follow the steps of the Optimisation 
Procedure. 


(a) Step 1: The derivative of the function 
f(x) = 423 — 2? —8r+1is 
f'(x) =a? — 24-8 = (x —4)(x +2), 
so the stationary points are at x = 4 and 
r=-2. 
Step 2: At the endpoints of the interval 
I = {-3,5], the function values are 
f(-3)=7 and f(5)= = -221, 
while at the stationary points aia are both 
inside J) the function values are 
f(-2)= 3 =105 and f(4)= 
Step 3: The age value of f(x) on [—3.5] is 
105 (at = —2) and the least value of f(x) on 
[-3.5] is -253 (at x = 4). 
(In this case, both of the optimum values occur 
at stationary points within the interval.) 


= —952 
= -253. 


(b) Step 1; The derivative of the function 
f(x) = 32? - 27 +5 is 
f' (x) = 6x —2 = 2(3r —1). 
so the only stationary point is at x = 3. 
Step 2: At the endpoints of the interval 
I ={-1,4), the function values are 
f(-1)=10 and f(4) = 45, 
while for the stationary point (which is inside /) 
the function value is 
S(5) = B= 43. 
Step 3: The greatest value of f(x) on (—1. 4] is 
45 (at 2 = 4) and the least value of f(x) on 
[-1.4] is 43 (at z= }). 
Solution 3.1 


Evaluating the ratio (sinh)/h for each of 

h = 0.1,0.01,0.001 in turn gives the respective 
answers (to 6 d.p.) 0.998334, 0.999983 and 
1.000000. The limiting value as h — 0 appears to 
be 1. 


Solution 3.2 


Evaluating the ratio (e —1)/h for each of 

h =0.1,0.001, 0.00001 in turn gives the respective 
answers (to 6 d.p.) 1.051709, 1.000500 and 
1.000005. The limiting value as h — 0 appears to 
be 1. 


Solution 3.3 
(a) The derivative of f(x) = a4 —2sinz is 
f(x) = 42° — 2cosz. 
(b) The derivative of g(z) = 3e* — 4 Inc is 
4 
"(n) = 967 — 4 
g(x) = 3e i 


(c) Since 1/a? = 2~? has derivative 
—2x~% = —2/x%, the derivative of 
p(x) = 6/x? + 3coszx is 


p(x) = 4 —3sinz. 


Solution 3.4 
(a) The first and second derivatives of V = 4xr* 
with respect to r are 
hiebier ev _ 
ta Arr? and PP = 8ar. 
The first and second derivatives of f(#) = 45lnz 
are 


y= 2 ands") =-%, 


(b 


so we have 
f'(5)=9 and f"(3)=—5. 
(c) The derivative of s = 10 — 5t? with respect to t is 


Sa eat so fe 


dt rl oes 


‘The first and second derivatives of u = 3cost 
with respect to t are 


(d 


u=-—3sint and ii =—3cost, 


Solution 4.1 


(a) In this case k(x) = f(x)g(x), where f(z) = 2 
and g(a) =e". Since f'(x) = 1 and g’(x) =e", 


we have 
K(x) = f'(a)g(a) + f(a)9' (x) 
= (1)(e*) + (a)(e*) 
=e*(1+2). 


(b) In this case k(ar) = f(x)g(x), where 
F(a) = 2a? — 1 and g(x) = cos. Since 
f'(x) = 4x and g/(x) = —sinx, we have 
K(x) = f'(x)g(x) + f(2)9'(x) 
= (4r)(cosx) + (2x7 — 1)(—sin x) 


= 4x cosa — (2x7 — 1)sinx. 
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(c) In this case k(x) = f(x)g(x), where 
f(x) = Ve =r"? and g(x) = x+sinx. Since 
Sf'(x) = 42-1? = 1/(2)/z) and 
g(x) = 1+ cosr, we have 
K(x) = f'(a)g(a) + f(@)9'(x) 
= (xz) (x +sinx) + (¥x)(1 + cosx) 
_ rt+sing 
= ae 
This may also be written as 
a+sing +2/r/zx(1 + cosx) 
2Vr 
_ 3x +sinx +2rcosx 


= WE 


+ Va(1+cosx). 


k(x) = 


Solution 4.2 
(a) The derivative of f(a) = (8x? + 2a + L)e* is 
f(x) = (6x + 2)(e") + (3x? + 2x + 1)(€7) 
= (3x? + 8x + 3)e". 
(b) The derivative of g(x) = sinx cos is 


g'(x) = (cos.x)(cos x) + (sinz)(—sin x) 


= cos" x — sin? x, 


(c) The derivative of f(t) = (#2 +3) Int is 
f(t) = (2t)(Int) + (# +3) (7) 
= 2tnt+ as 
Solution 4.3 


(a) In this case k(x) = f()/g(x), where f(x) = e* 
and g(x) =x. Since {"(a) = e* and g/(x) = 1, 


we have 
Ny — $2) F(z) = F(x)g' (2) 
Re) = G@? 
— (x)(e*) ~ (e*)Q) 
(x)? 
ys e* (a —1) 


=] 
(b) In this case k(x) = f(x)/g(x), where 
F(a) = cosx and g(x) = 2? — 1. Since 
f'(x) = —sinx and g'(x) = 2x, we have 
(g(x)? 
(a? =1)(—sinx) — (cosx)(2x) 
= (x? - 1) 
(x? — 1)sinz + 2xcosxr 
(8-1) 


K(x) = 
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(c) In this case k(x) = f(x)/g(x). where 
f(z) ="+1 and g(x) = 2-1. Since 
f'(z) = g'(x) = 1, we have 


_ oa)" (x) = F(2)9"(x) 


- 
Be) oe)? 
_ @=1)0) = (e+ 90) 
(@—1? 
ES. 2 
~ @=1F 
Solution 4.4 


(a) The derivative of f(r) = sin/ cosa is 
1.) — (cosir)(cos xr) — (sinx)(—sinx) 
f(z) = aa 
_ cos? ar + sin? x 
ae cos? x 


u 
cos? 


Since 1/ cos = seca, this shows that 

fila) = sec? x. 
(b) The derivative of g(u) = Inu/(u? + 3) is 
_ (u? +8)(1/u) — (In-u)(2u) 
2 (u? +3)? 
u? +3 —2u*Inu 

u(u? + 3)? 

(c) The derivative of p(t) = t/(e' — 1) is 
(et =) = (He) 


g'(u) 


p(t) = 


(e = 1)? 
_ (=tet-1 
(e-1) 
Solution 4.5 


(a) The function k(a) = In(a? +1) is of the form 
k(x) = g(f(x)) with 
u=f(x)=a27+1 and g(u)=Inu. 
(b) The function k(«) = cos(a*) is of the form 
A(x) = g(f(w)) with 
u=f(x)=2> and g(u) =cosu. 
(c) The function k(x) = cos* x = (cos)? is of the 
form k(a) = g(f(x)) with 
u=f(«)=cosx and g(u) =u. 
(Note that this case involves the same 


constituent functions as in part (b), but applied 
in the reverse order.) 


(d) The function k(2) = e*"* is of the form 
h(a) = g(f(a2)) with 


u=f(a)=sine and g(u) =e". 


(because cos? x + sin? x = 1). 


Solution 4.6 

(a) In this case k(x) = e*” = g(f(x)). where 
u= f(z) = 2° and g(u) =e". Now f'(x) = 3x? 
and g'(u) = e", so we have 

K(x) = g'(f(a))f'(@) 
=gl(u)f'(x) (where u= f(x) = 2°) 
= (e")(327) 
=3r%e* (since u = 2°). 

(b) In this case k(2:) = sin(2x) = g(f(x)), where 
u= f(x) = 2x and g(u) =sinu. Now f’(x) = 2 
and g'(u) = cosu, so we have 

K(x) = g'(f(a))f'(@) 

g/(u)f'(x) (where u = f(x) = 2x) 

(cos u)(2) 

=2cos(2r) (since u = 2r). 


(c)_ In this case k(x) = cos(e") = g(f(x)), where 
u= f(x) =e and g(u) =cosu, Now f'(«) = e* 
and g/(u) = —sinu, so we have 

W(x) =9'(f(@))f'(@) 
=q'(u)f'(z) (where u= f(x) =e") 
= (—sinu)(e”) 


=-e"sin(e”) (since u = e*). 


Solution 4.7 


In each case, a concise solution (obtained by 
‘informal’ use of the Composite Rule) is followed by 
an expanded version. Note in the expanded versions 
for parts (a) and (b) that f and g cannot be used to 
denote the ‘inner’ and ‘outer’ functions, since one of 
them appears as the name of the given composite 
function. Hence we label the composite as q(p(x)) 
instead. 


(a) The derivative of f(«) = cos(4x) is 
f'(x) = (—sin(4x))(4) 
= —4sin(4z). 
Expanded version: In this case 
F(x) = cos(4x) = g(p(x)). where u = p(x) = 4x 
and q(u) = cosu. Now p'(x) = 4 and 
q'(u) = —sinu, so we have 
f(x) = ¢(v(2))p'() 
=q(u)p'(x) (where u = p(x) = 4x) 
= (—sinu)(4) 
=-—4sin(4x) (since u = 42). 


(b) The derivative of g(x) = In(x? + x +1) is 


o@)= (seq) e+) 
_ _2e+1 
- See 
Expanded version: In this case 
g(x) = n(x? + x + 1) = q(p(x)), where 
u = p(x) =2? +2+1 and q(u) =Inu. Now 
p'(a) = 2a +1 and q’(u) = 1/u, so we have 
g(x) = 4'(P(a))p"(@) 
=a'(u)p'(x) 
(where u = p(x) = 27 +241) 
= (:) (2x +1) 
==—— _ (sinceeu=2? +241). 
(c) The derivative of v(t) = sin(cost) is 
v'(t) = (cos(cos t))(— sin t) 
= —sintcos(cost). 
Expanded version: In this case 
u(t) = sin(cost) = q(p(t)), where u = p(t) = cost 
and q(u) = sinu. Now p'(t) = —sint and 
q'(u) = cos u, so we have 
v(t) =a'(w(t))p'(t) 
=q(u)p'(t) (where u = p(t) = cost) 
= (cos u)(—sint) 
=-sintcos(cost) (since u = cost). 


Solutions to Exercises 


Solution 1.1 


Result (1.1) states that if f(a) = ax? + br +c, then 
f'(v) = 2ax +b. For y = f(x) = 6 + 5x — 42. we 
have a = —4, b = 5 and c = 6, so that 


f'(w) = —8a + 5. 


Hence the gradient at the point (2.0), where z= 2, 
is f'(2) =-8x 24+5=-IL 


Solution 1.2 


(a) Putting the left-hand side fractions over the 
common denominator (x + h)x, we have 


I ee » ath 
zth 2 (@+h)e (e+h)r 
_ @—(e+h) 
~ (@+h)r 

r h 
~ (a+ ha’ 


as required. 


(b) By equation (1.3), and using the result of 
part (a), we need to consider the quotient 


ed aa ‘) 


h “h\ath & 
1 h 
~h (atm) 
a 1 
~ (ep ha" 


For small values of h, the quotient —1/((« + h)z) 
is close to —1/2?. We conclude that 


1 


I(@) = 


(c) Result (1.4) states that if f(a) =a", then 


I'(v) =na"—|. Here f(x) = 1/r 1 so that 
n= -—1. The derivative is therefore 

f'(x) =—1271-! = -27?, 
which is often written as f’(a) = —1/2x?, in 


agreement with the result of part (b). 


Solution 1.3 


(a) The position function is s = f(t) = 3? +t. This 
is a quadratic function of t, so we apply 
result (1.1) to obtain the velocity function 


v=/f'(t)=6t+1. 
(b) When ¢ = 10, we have 


v= f’(10) = 6 x 10+ 1 =61, so the velocity is 
then 61 ms~!. 


(c) When v = 25, we have 25 = 6t + 1, which has 
solution t = 4. Hence the aeroplane has velocity 
25 ms~! after 4 seconds. 
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Solution 2.1 


(a) From result (1.4), the derivative of x!" is 
10121, the derivative of x°° is 50x*° and the 
derivative of x (=r!) is 1 (= 2°). From 
result (2.3), the derivative of the polynomial 
function f(a) = 7r!! — 132°° + x is 


f'(x) = 71012!) — 13(502*°) +1 
= 707x' — 6502"? +1. 


(b) From results (1.4) and (2.3), the derivative of 
the polynomial function 
f(x) = 52° + 42? — 9x —8 is 
J'(x) = 5(82?) + 4(2) — 9(1) 
= 15x? + 84 —9. 
The point (—1,0) on the graph of y = f(x) 
corresponds to « = —1, for which the gradient is 
f'(-1) = 15(-1)? + 8(-1) - 9 = -2. 
Solution 2.2 


(a) The derivative of the function 
f(z) = —a — 32? + 92 +5 is 
Sf'(x) = -3x? - 6x + 9 = —3(a +3)(x — 1). 
Solving the equation f’(x) = 0, we find that the 
stationary points of f are at c= —3 and x = 1. 
(b) (i) Step 1 of the strategy has been carried out 
in part (a). Step 2 is as follows. 
To classify the stationary point at « = —3, we 
choose test points xy, = —4 and rp = 0, say. 
Then we have 
J'ai) = f'(—4) = -3(-1)(-5) = -15 < 0, 
f'(xr) = f'(0) =9>0, 
so f has a local minimum at x = —3, by the 
First Derivative Test. 


To classify the stationary point at 2 = 1, we 
choose test points xy = 0 and wg = 2, say. Then 
we have 


f'(ar) = f'(0) =9 > 0, 
J'(wp) = f'(2) = -3(5)(1) = -15 <0, 


so f has a local maximum at x = 1, by the First 
Derivative Test. 


(ii) The second derivative of f is 
f(z) = -62 — 6 = —6(z + 1). 

For the stationary point at « = —3, we have 
f"(-3) = -6(—2) = 12 >0, 


so f has a local minimum at x = —3, by the 
Second Derivative Test. 


For the stationary point at x = 1, we have 
f"(1) = -6(2) = -12 < 0, 
so f has a local maximum at x = 1, by the 
Second Derivative Test. 
(c) The y-coordinates of stationary points on the 
graph of y= f(x) are 
f(-3) = 27 - 27-27 +5 = -22, 
f(1) =-1-3+9+5=10. 
Also f(0) =5. Hence the graph passes through 
the points (—3, —22), a local minimum, and 
(1, 10), a local maximum. It also passes through 


(0,5): that is, it cuts the y-axis at y= 5. A 
sketch of the graph is as follows. 


Ua 


av 


(-3,-22) 


Figure S.2 
Solution 2.3 


(a) The second derivative of f is obtained by 
differentiating f'(a) = 3x? — 12x — 15, to obtain 


f(a) = 6x — 12 = 6(a — 2). 
For the stationary point at 7 = —1, we have 
f"(-1) = 6(-3) = -18 <0, 


so f has a local maximum at x = —1, by the 
Second Derivative Test. 


For the stationary point at « = 5, we have 
£"(5) = 6(3) =18 > 0, 


so f has a local minimum at x = 5, by the 
Second Derivative Test. 
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(b) In each case, we follow the steps of the 


Optimisation Procedure, but Step 1 has already 
been completed. The stationary points of the 
function f(x) = x? — 6x? — 15x + 54 are at 

a =—landz=5. 

(i) Step 2: At the endpoints of the interval 

I = (0,7), the function values are 


f(0)=54 and f(7)=-2, 


while for the stationary point at « = 5 (which is 
inside J) the function value is 


£(5) = —46. 
(Note that, on this occasion, the other 
stationary point (at « = —1) is not inside J, and 


hence does not feature at this stage.) 


Step 3: The greatest value of f(r) on [0,7] is 
54 (at = 0) and the least value of f(x) on (0,7| 
is —46 (at x = 5). 

(ii) Step 2: At the endpoints of the interval 

J =(-4,7]. the function values are 


f(-4)=-46 and (7) =—-2, 


while for the stationary points (both of which 
are inside J) the function values are 


F(-1) =62 and f(5)=—46. 


Step 3: The greatest value of f(a) on [—4,7] is 
62 (at x = —1) and the least value of f(x) on 
{—4. 7] is —46 (at both x = —4 and x = 5). 


Solution 3.1 


(a) The derivative of u(x) = 4sinx —5cos is 


u(x) =4cosxr + 5sinz. 


(b) The derivative of u(r) = 7Ina — 2e” is 


u(x) = <6", 
x 


(c) Since 1/x = «~! has derivative —x~? = -1/2?, 


3 


and 1/2? = x~* has derivative —2x -2/x3, 
the derivative of 
w(x) = 3x? — 22 + 1-1/2 + 2/2? is 
1 4 
w' (x) = 6x — 2+ zy 
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Solution 3.2 


(a) The first and second derivatives of 
P =cosu—sinu with respect to u are 


Fe 7 Sinu—cosu, 
(b) The first and second derivatives of 
v=—e' +17t+3 with respect to t are 


= -e' +17, * 


b=-et. 
(c) The derivative of w = 5s — 10s? with respect 
to s is 
aw 208 20a, 50 22) eda, 
ds ds |,-5 


Solution 4.1 
(a) The derivative of f(a) = (a* +2) sin is 
f'(x) = (3x? + 1)(sinx) + (a3 + x)(cosx) 
= (3x? + 1) sina + (2 + x) cos x. 
(b) The derivative of g(x) =e” cos is 
g(x) = (e*)(cos x) + (e*)(—sin x) 
= e"(cos x — sin x). 


(c)_ The derivative of Vf = t'/? is 3171/2 = 1/(2v*). 


Hence the derivative of u(t) = Vélnt is 


u'(t) = (2) (Int) + (ve) (7) 


Int+2 
Qt 


Solution 4.2 
(a) The derivative of g(x) = «*/Inx is 
rye) — (lne)(4ar*) — (at) (1/2) 
g(x) = Sane 
_ #3(4Ina —1) 
~ (nz)? * 
(b) The derivative of k(t) = (sint — t)/e* is 
(et) (cost — 2t) — (sint — t?)(et) 
(et)? 
_ cost —sint +t? — 2t 
ae 


R(e)= 


ag 2 a, Beatle 
(c) The derivative of f(«) = ae 


iq) — (6082)(0) ~ (1)(=sinz) 
fe) = ee 

sina 

cos? x 
(Since 1/cosx = secx and sinx/cosz = tanx, 
this shows that the derivative of f(a) = secx is 
f'(x) =secz tan.) 
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=—cosu+sinu. 


Solution 4.3 


(a) 


(b) 


(c) 


The derivative of k(x) = sin(In 2) is 


W(2) = (cos(nz)) (2) 
Expanded version: In this case 
k(x) = sin(Inx) = g(f(x)). where 
u= f(z) =Ine and g(u) =sinu, Now 
f'(x) = 1/2 and g’(u) = cos u, so we have 


K(x) = g'(f(@))F'(@) 


_ cos(Inx) 
Se ae 


=g'(u)f'(z) (where u = f(x) =Inz) 
“on 
= cool e) (since u = Inz). 


The derivative of k(a) = sin’ x = (sinr)* is 
k(x) = (4(sin.x)*) (cosr) = 4 cosxrsin’ wv. 
Expanded version: In this case 
k(x) = (sinx)* = g(f(x)), where 
u= f(x) =sinz and g(u) = ut. Now 
f(x) =cosx and g/(u) = 4u%, so we have 
K(x) = 9 (f(a) f'(z) 
=q'(u)f'(x) (where u= f(x) =sinz) 
= (du*)(cos x) 
=4cosrsin’z (since u=sinz). 
The derivative of k(«) = eV” is 
ve 
TEEN CRA (| 
H0)= (*) (Fe) =e 
Expanded version: In this case 
k(x) = e¥* = g(f(x)), where 
Vr =«'/? and g(u) =e". Now 
~W/2 = 1/(2)/x) and g'(u) =e", so 


K(x) = 9'(f (a) f'(2) 


=g(u)f'(x) (where u = f(x) = V2) 
uy (2 

- (x2) 
VE 

= aE (since u = Vx). 


(d) The derivative of p(t) = In(e* + t) is 


(f) 


nae 1 t el 
r= Loar (sO eS me 
Expanded version: In this case 

p(t) = In(et + t) = g(f(t)). where 

u= f(t) =e'+¢ and g(u) =Inu. Now 
f(t) =e' +1 and g/(u) = 1/u, so we have 


P(t) =a (FOF (0) 
=9'(u)f'(t) (where u= f(t) =et +t) 


ie () (ef +1) 


e+1 
me (since u = e' +t). 


The derivative of g(s) = cos(1/s?) is 


a(s) = (-sin (4)) (-2)- HL 


Expanded version: In this case 

(8) = cos(1/s?) = g(f(s)). where 

= cosu. Now 
f'(s) = —28~8 = —2/s9 and g'(u) = —sinu. so 
we have 


1 (8) = 9'(f(s))f'(8) 
=g'(u)f'(s)_ (where u = f(s) = 1/s?) 


= (—sinu) (-3) 


2sin(1/s?) 
Sia 


(since u = 1/s*), 


The derivative of r(y) = 1/ cos? y is 


w= (-az) Cain = 2552. 


cos y cos? y 


Expanded version: In this case 
r(y) = 1/ cos? y = g(f(y)), where 
u= f(y) = cosy and g(u) = 1/u? = u-*. Now 
f'(y) = =siny and g'(u) = —2u-3 = —2/u3, so 
we have 


r'(y) =9'(f(u)) f(y) 
=a (u)f'(y) (where u = f(y) = cosy) 
= (-3) -siny 
_ 2siny 
=e 


(since u = cosy). 


(Note that this case involves the same 
constituent functions as in part (e), but applied 
in the reverse order.) 
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